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PREFACE 


Chapters 1 and 3 of this volume represent an innovation. They review topics not 
on the Master Plan but of much current interest. They were also prepared more 
speedily than the other chapters so that they could reflect the most recent devel- 
opments. We intend to include such special chapters in the forthcoming volumes 
and will welcome reactions to the chapters and to this policy. 

The Master Plan serves as a guide but not as a rigid prescription for the 
allocation of subjects in the successive volumes. Furthermore the Plan is subject to 
review and modification at each of the annual meetings of the Editorial Com- 
mittee. The current form of the Plan is given below, and comparison with the 
version in Volume 22, 1971, will show a number of important changes, both in the 
listing of topics and in the recommended frequencies of their occurrence. 

Authors are asked to review selectively within the area assigned. They are not 
expected to review a whole domain but to choose the most significant and impor- 
tant developments within that domain. Nor are they required to restrict their 
treatment to work of the last few years. In some cases, as will be seen in this present 
volume, authors select a specialized subsection of the topic for intensive review. 
Each volume thus emphasizes some topics while slighting others, but over a span 
of a few years the selection of topics and of authors is intended to give a rounded 
portrayal of current developments of psychological research and theory, The 
Editorial Committee sees its main task as nominating highly qualified authors who 
are then given wide discretion, but limited space, for the preparation of their 
reviews. 

Paul Mussen, as outgoing editor, shared the editorial duties of Volume 25 
(1974); because of an error his name was not listed as an editor in that volume. It is 
a pleasure to have collaborated with Paul Mussen in the preparation of Volumes 
20-25 and to be sharing these duties now with Lyman Porter. 
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THE PSYCHOLOGY OF 5225 
WOMEN-SELECTED TOPICS 


Martha T. Shuch Mednick! 

Department of Psychology, Howard University, Washington, D.C. 20001 
and Hilda J. Weissman ; 

Washington School of Psychiatry, Washington, D.C. 20009 


It is fitting that the first article on this topic in the Annual Review of Psychology 
be written in the inaugural year of the APA Division on the Psychology of Women. 
The Division has defined itself as dedicated to research on women; a charter 
membership of more than 800 must be viewed as one measure of current interest 
in this field. It is in practical terms too soon to look at the effect ofthe Division; we 
hope that the reviews that follow this mini-survey will reflect its catalyzing func- 
tion and provide a more comprehensive view ofthe field than is now possible. The 
study of many aspects of the female experience is accelerating, and the problem 
areas and questions raised touch on every substantive, methodological, and prac- 
tical domain of psychology. 

This is a report on recent work in selected areas. It is limited by space, by the 
frank biases of the authors, and by topical popularity. We will turn first to the 
psychological study of sex roles, fundamental to most research concerning women. 
The second topic concerns a new look at achievement motivation in women. 
While some segments of society are rejecting the achievement ethic, there is a 
growth of interest in patterns and determinants of women’s school performance 
and career development (4, 43, 54, 114, 134). The third topic concerns the impact 
of feminist thought and knowledge about female sexuality on the theories and 
practice of psychotherapy. The common thread throughout is a questioning of 
age-old assumptions about sex roles and sex identity. 

There is and continues to be a great emphasis on spelling out differences in 


| Weissman is responsible for the section on therapy and Mednick is responsible for choice 
of topics and the remainder of the review. The authors wish, to express appreciation to 
Saundra R. Murray for her generous contribution of time, effort, and ideas, and to Leona W, 
Levine for editorial and other assistance in the preparation of the manuscript. 

The literature survey is current, stressing 1972 and 1973; however, selected works pub- 
lished earlier than 1972 are mentioned, as are several appearing in early 1974 or still in press. 
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abilities and personality traits and their biosocial bases (123). Older and newer 
works set in the frame of infant and childhood development have provided us with 
cogent reviews and thorough analyses (80, 81, 124). The controversies are not 
resolved but they have been well defined, and certainly this will continue to be a 
“hot topic” generating emotion and political action as well as research. This is not 
the main concern of this review; the focus is on women more than girls and on 
intragroup variations and individuality more than on sex differences. Sex 
comparisons cannot be and are not ignored—in fact, we may in the end realize that 
understanding women has led us to a clearer understanding of men and of people 
in general (22). 

Two recent texts on the psychology of women (6, 123) emphasized the biological 
approach as well as early socialization. The text by Sherman is a well-reasoned, 
analytical look at the empirical findings about females. More questions are asked 
than answered—an accurate reflection of the state of the art. Bardwick’s text is 
personal, presenting a biosocial theory of’feminine development whose basis has 
been questioned? (33). Bardwick has edited a book of readings (7) which also 
stresses biology and early socialization (10). Other collections have taken a 
social-psychological approach (89, 90, 138). A comprehensive annotated bibli- 
ography of the interdisciplinary as well as international literature on sex roles is 
available (3). 


PSYCHOLOGICAL STUDY OF SEX ROLES 


Research and writing about sex roles have moved from description and an ac- 
ceptance of the givens (120) to a concern with the dynamics and implications of 
change (59, 60, 63, 76, 87). Compelling questions about the nature and mutability 
of social institutions such as marriage, the family, and even motherhood and 
fatherhood have been raised (14, 15, 56, 59, 116). Holter (60) presents an analysis 
of how social structures perpetuate even nonfunctional sex role distinctions, and 
spells out the requirements and consequences of role change. She is fully cogni- 
zant of the psychological factors which help maintain the status quo, such as the 
power of entrenched attitudes and behaviors and immutable self concepts. Similar 
points have been raised by Bem & Bem (13) in a social-psychological analysis of 
the influence and pervasiveness of sex role ideology. The implication of role 
change for men (135) and generally held conceptions of masculinity has received 
some attention (102), but little empirical work has appeared. 

Sex role variation has been studied in numerous societies (16, 18, 50, 61, 87, 113, 
116, 120). ‘The comparisons help clarify the possibilities of change, but causal 
generalizations are problematic. It does seem clear that even in societies with 
conscious women’s emancipation ideologies, role change is superficial (59, 88). 
Major social factors such as variation-in birth rates, sex ratios, technological 
changes, economic conditions, and national crises (75) may be more potent (even 


2 See also review by E. E. Maccoby ae on The Meaning of Being Female in Contemp. 
Psychol. 17:369-72. 


THE PSYCHOLOGY OF WOMEN 3 


if temporary) effectors than ideology or legislation, and these must be included in 
any study of role variation over time and across cultures. 


Sex Role Identity 


Discussions of sex or gender identity emphasize models of early socialization and 
their implications for later personality functioning and malfunctioning (e.g. 65, 68, 
93). Although critical of the content of Freudian thinking about the development 
of the feminine personality, Bardwick (6) adheres to classic psychodynamic 
assumptions regarding the significance of biology and early experience. Such as- 
sumptions (unleavened by considerations of environmental challenge) lead to 
pessimistic conclusions about the possibilities of sex role change, and their validity 
is being questioned. 

Challenge is also directed at the definitions and measures of many over- 
lapping sex role concepts. Indeed major integrative problems stem from the 
vagueness of the constructs and capricious variation in measurement. For 
instance, Constantinople (26) addressed part of this problem in a thorough review 
of the literature on masculinity-femininity (MF). She noted the definitional am- 
biguity which was not resolved by the operational recourse to “items which 
separate men from women.” Major measures of MF confuse or merge the con- 
cepts of sex role identity, sex role preferences, and sex role adoption. Indeed MF 
measures are also called measures of sex-typing (17) and sex role standards (35, 
130). Constantinople concludes that most MF instruments are multidimensional, 
including various affective and cognitive components, and moreover rejects the 
assumption that MF is a bipolar trait. Thus evidence suggests that masculinity and 
femininity are not opposites; each should be conceptualized and measured inde- 
pendently. This is consonant with Carlson’s (22) view that characteristics viewed 
as masculine and those regarded as feminine may co-exist and interact. Helson’s 
(e.g. 55) studies of the creative personality focus on such a conception and indicate 
that effective creative functioning in men and women may be best understood in 
these terms. 

Other recent research is also addressed to the bipolarity controversy. J. H. Block 
(16, 17) presents a model of sex role conceptions and provides evidence, based on 
cross-national and longitudinal data, that the most effective functioning is a 
product of the incorporation of positive aspects of both the masculine and fem- 
inine sex roles. However, she found that the socialization process enhances this 
trait co-existence for men but not for women. Thus, “For males, socialization 
tends to enhance experiential options and to encourage more androgynous sex 
role definitions ... for women the socialization process tends to reinforce the 
nurturant, docile submission and conservative aspects of . . . the role... discour- 
age self assertiveness, achievement orientation and independence” (16, p. 525). 

Similar results are reported by Spence and her colleagues (130), who found that 
for men and women the endorsement of both highly valued masculine traits and 
feminine traits correlated positively with ratings of self esteem. S. L. Bem (11, 12), 
who like Spence et al (129) is engaged in the development of a new instrument, 
found that androgynous individuals are more likely to behave adaptively, i.e. do 
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what the situation rather than a rigid sex-role self-concept demands. She also 
noted a “behavioral deficit” in the most feminine females. That the female role is 
related to indicators of stress, conflict, and incompetence has been demonstrated 
in several other recent studies (48, 66, 105, 109, 147). It seems quite safe to predict 
that the Zeitgeist will combine with the stimulating force of new techniques and 
alternative theories to generate further empirical work on the redefinition and 
refinement of these concepts. 


Stereotypes 


An extensive series of studies conducted by Broverman and her colleagues has 
become standard reference in the research and in popular feminist literature. This 
work has even been referred to and inserted in hearings on sex discrimination 
before the Congress of the United States. In a review (20) of their own work, these 
investigators concluded that stereotypic thinking about sex role related personality 
traits is pervasive. Furthermore, a greater number of desirable traits are assigned 
to men than to women, the valued traits for men forming a competence cluster, 
while those for women form a warmth-expressiveness cluster. 

Stereotypic thinking and adherence to traditional role expectations does appear 
to be pervasive. It is found in school settings (5, 36, 78, 79, 101, 115), in children’s 
literature (143), in U.S, as well as Israeli communes (88, 92, 148), in widely used 
text books (8),° in the mass media (107), and in language styles (141). The 
occupational world is sex-typed (70, 100, 125); specific job-sex patterns vary across 
cultures, but prestige and economic value is always higher for those which are 
male occupied (59, 88, 116). Occupational prestige levels change when the other 
sex moves in; this has been recently verified in a laboratory setting (137). The 
perceptions of required and personal job characteristics or requirements also 
reflect sex role assumptions (119). 

There is some work suggesting that stereotypic sex role attitudes held about 
others or the self are significant moderators or determinants of various classes of 
behaviors. These include occupational choice (72, 75), attraction (121), activism 
(118), and achievement (2, 77, 101, 136; also see section on achievement). Several 
studies indicate that sex role considerations may account for variations in helping 
behavior and that the nature of the situation interacts with biological sex and sex 
role in the elicitation of help. Men and women helpers are equally affected (47, 52, 
139). 

Ellis & Bentler (35) report that women but not men who perceive extreme sex 
differences endorse traditional role standards. Additionally, women who scored 
high on a measure of status seeking regarded themselves as nontraditional—that is, 
different from women in general. It is also of interest that women like women 
depicted in masculine but not feminine role activities, even when they are in- 
competent. This is most true for women with pro-feminist attitudes. Men on the 


> A report on sex bias in psychology textbooks is available from the American Psycho- 
logical Association, Office of Educational Affairs (unpublished manuscript, J. Birk et al). 
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other hand did not show this “pro-masculine bias” (128) and liked competent 
women regardless of the role enacted. 

Women’s views of men’s opinions affect their behavior (53, 62), but there is 
some evidence that men don’t think what women think they think (132). With 
regard to opposite sex attraction, women were attracted to “masculine” males, 
while men were equally attracted to “masculine” and “feminine” females (121). 
Males’ response to women depicted in roles varying in degree of “liberatedness” 
was examined in a study by Dufresne (32). Liberated women aren’t “liked,” but 
they are assigned higher status jobs than the neutrally regarded traditionals. 
Equalitarian women fare the best; they are liked and given high status assign- 
ments. Studies of this type should be repeated with different role incumbents, 
since the actor’s personal characteristics may influence judgments. Another ques- 
tion concerns the subject populations ... will men (or women) actually doing the 
hiring respond as college sophomores do? Also critical is a look at the extent to 
which attitudinal statements and judgments about videotaped or otherwise simu- 
lated role incumbents relate to behavior. Such research is needed before we can 
conclude that women are totally misguided in their beliefs about “What Men 
Want.” Finally, questions about the reactions of women to variations in male role 
behavior as well as men’s concerns about what women think have yet to be 
explored. 


ACHIEVEMENT TENDENCIES IN WOMEN-— 
IS THERE A NEW LOOK? 


In their article on personality in the 1971 Annual Review of Psychology, Sarason & 
Smith singled out the area of achievement motivation as one which has been based 
almost exclusively on the study of men. This practice was accurately attributed to 
the conclusion by McClelland and his colleagues that women need approval and 
men need success. It should be added that neither McClelland nor Atkinson nor 
their offspring chose to identify their work as the psychology of male achievement 
motivation. The message was left between the lines. Sarason & Smith noted, “In 
view of evolving changes in cultural conceptions of female roles and aspirations, 
greater contemporary research on achievement motivation in women seems war- 
ranted” (118a, p. 409). The view from early 1974 indicates that this is happening; 
the following reviews a portion of recent work. 

Two review papers framed in different theoretical contexts focus on the 
affiliation-achievement conflict. Hoffman (57) asserts that females are inade- 
quately socialized for independence and thus do not develop adequate mastery 
skills. She argues, too, that female achievement behavior is motivated by a drive to 
please rather than to succeed and that success and affiliation are in reality often 
incompatible. A specific prescription for mastery training for girls is suggested. 
However, Stein & Bailey (131) conclude that girls and women are achievement 
oriented but learn to value sex role appropriate social skills. The basis for learning 
is the same for boys and girls: social reinforcement, i.e. response to approval. They 
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reject the hypothesis that the males and females are different in level of affiliative 
need, basing much of their argument on evidence that boys and girls are not 
differentially responsive to social approval or disapproval in a variety of social 
learning experiments. Stein & Bailey and Hoffman agree that certain kinds of 
achievement behaviors are incompatible with the typical feminine sex role train- 
ing pattern, and that independence and mastery training must be facilitated (also 
124). | 


Fear of Success 


The fear of success (FOS)* was postulated by Horner (62) within the framework of 
Atkinson’s expectancy-value theory to account for sex differences in achievement 
behavior. She developed a fantasy measure of the motive that accounted for 
performance decrements by high ability women in mixed-sex competitive 
achievement settings. The idea was that women were motivated to achieve but that 
fear is aroused by the anticipation of negative consequences of success. It is not a 
wish or need to fail. The latter concept, which one writer does introduce’ and 
at least one study purported to examine (85), would imply an expectation of posi- 
tive consequences from failure in achievement situations. 

Recent studies of FOS have looked at variations in incidence of imagery and 
have generally attempted to explore the validity of the concept. Although Horner 
noted an increase in FOS in women from 1965 to 1971, others report lowered 
incidence or diminished sex differences for a variety of settings and age groups (21, 
37, 58, 104, 112, 140). That social factors affect degree of FOS imagery is suggested 
by several studies of black women (87, 106, 145). Hoffman found no sex differ- 
ences, but women did show as much FOS in 1971 as in 1965. Thematic analysis 
suggested that college women are as fearful as ever but that college men now seem 
to be downgrading success (58). A study of students in the fifth through eleventh 
grades reported similar thematic differences (21). On the other hand, Ward & 
Mausner (140) found that their high school students told few affiliative rejection 
stories. Monahan et al (94) found high FOS in women, but both men and women 
told FOS stories for successful women but not for men. They suggest that this 
supports a learned social role rather than motivational explanation for the con- 
cept. Horner and others (21, 58, 112) have attributed variations in both male and 
female imagery to sex differences in scoring, the changed social climate, subject 
characteristics, and stimulus properties. To date, however, the results are such that 
none of the explanations are consistently supported. 

Other investigations have attempted to verify predictions about the effects of 
variations in FOS. The hypothesis that opposite sex competition leads to a per- 
formance decrement for women is supported by several studies (62, 86, 104, 144) 


4 Margaret Mead referred to women’s fears of success in Male and Female, published in 
1949. Klinger postulated the same concept in another context (69). 

> J. E. Albino (unpublished manuscript, 1974) makes the provocative suggestion that this is 
analogous to the Freudian concept of feminine masochism, but also notes the absence of 
evidence for such a motive. 
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but not by others (21, 27, 127, 140). Sex role traditionality was found to moderate 
the expected relationship of FOS and performance (104) and was the only sig- 
nificant predictor in another (103); Sorrentino & Short (127) found that high FOS 
women performed better on a male sex-typed task than a “female” task. This is not 
only contrary to Horner’s predictions, but also to other work (131) regarding 
sex-typing and achievement oriented behavior. The nature of the task was also 
explored by McGuinness (86) in the only study that varied difficulty level. A 
performance decrement due to mixed sex competition was related to FOS for both 
men and women, but only on a task of moderate difficulty. This suggests that the 
study of the effect of task variables is a fruitful direction for further research. 

Horner’s theory also predicts that high ability women are most likely to have 
FOS. The complexities of predicting actual achievement from individual differ- 
ences in motivation, expectancy, or attributional measures have been thoroughly 
discussed in the achievement motivation literature (108, 142). Direct tests of the 
relationship between ability and FOS do not support the theory (71, 104), but 
several studies may be cited as showing indirect support (21, 99, 127). 

Peplau’s (104) study of dating couples appears to reject Horner’s prediction that 
attitudes of and perceptions of supportiveness from a boyfriend is negatively 
related to FOS. A study of black women also rejected this hypothesis, but found 
that the degree of attachment to a man (e.g. dating vs engaged) moderated the 
relationship of FOS and attitudes about black militancy. Thus attached women 
who had militant attitudes had less FOS than their unattached counterparts (106). 

Horner’s construct has indeed stimulated an energetic “new look” in one di- 
rection, but it is clear that refinement of the measure® and the concept should be 
encouraged so that adequate testing and modification of the theory may proceed. 


Expectancies and Causal Attribution 


The general literature on achievement motivation has burgeoned with studies 
emphasizing cognitive factors as influencers of achievement behavior (38, 142). 
The research on sex differences in expectancies and causal attributions about 
success and failure has been summarized by Frieze (42; see also 131) and also 
suggests promising new directions for research on women and achievement. Many 
have argued that motivational research reinforces victim rather than institutional 
blame (112). However, it is also important to recognize that self blame is generated 
through certain types of misattributions. Gurin & Gurin (49) have suggested that 
the effect of social change on expectations about success and failure and behav- 
ioral change must be based on a complete understanding of the self-attributional 
process. They suggest that this process may provide a linkage of institutional 
change to individual response. 

Recent reviews and new findings (29, 40, 42, 131) clearly document lower 
expectations for female than male performance on tasks involving intellectual 
mastery. We have already seen that tasks and traits typed as female are devalued. 


6 A new scoring system was demonstrated by Horner at the American Psychological 
Association, 1973. 
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Expectancies about one’s own performance are also low; women generally expect 
to fail (42, 131). 

There are also sex differences in causal attributions for success and failure (42). 
Men generally attribute their success to stable factors such as ability and failure to 
luck which is unstable. Women, on the other hand, tend to attribute failure to 
ability and success to effort. However, these relationships are moderated by 
individual differences in level of achievement motivation (42), fear of success (39), 
ability level, and the relationship of expected to actual outcomes (126). Using 
Weiner’s (142) attributional analysis of achievement motivation as a conceptual 
framework, Frieze (42) derives practical implications for women’s achievement 
efforts from these findings. 

The importance of sex role considerations was shown in a study of causal 
attributions about others’ performance (29), It was found that both men and 
women rated men as more successful than women on “masculine” as well as on 
“feminine” tasks. On masculine tasks good performance by a man was attributed 
to skill, while good performance by a woman was attributed to luck. Furthermore, 
performance on a masculine task was valued more highly than on the feminine 
task. In the same vein, Feldman & Kiesler (40) report that “those who are number 
two try harder.” When evaluating peers, subjects attributed women’s success to 
hard work and failure to lack of effort. This was not the case for the attributions 
about men’s performance. In a second phase of the study they found that men and 
women were expected to be equally successful as pediatricians but not as surgeons. 
Again, woman’s success was attributed to her hard work rather than her ability. As 
these authors note, other findings (38) indicate that stable factors such as ability 
are found to be attributed to expected outcomes, and since men are expected to be 
successful, such attributions are probable. In contrast, variable factors such as 
effort are used more often to explain unexpected outcomes—such as woman’s 
success. Self attributions and attributions about others seem to go hand in hand, 
and both reflect expectations based on sex role assumptions. 


WOMEN AND PSYCHOTHERAPY 


Criticism of the Freudian theory of female psychosexual development has been 
simmering along since it was first presented. The theoretical issues and their 
consequences for women in “real life” as well as in psychotherapy surged to a full 
boil in the late 1960s. The assault on the validity of the theory and discussions of its 
influence on therapists and on the Zeitgeist, as well as its consequences for women 
in or out of emotional difficulties has continued to grow in both volume and 
intensity. However, as can be seen from the following summary, there is more 
speculation than empirical work. 

Rice & Rice (111) write of the perplexity and lack of empathetic understanding 
of male therapists concerning their women patients’ complaints and dis- 
satisfactions. Explaining women’s resulting hostility as transference has be- 
come less satisfactory. 

The feminist issue in psychotherapy has been stated in extreme terms. It is 
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Chesler’s (24) thesis that men (including men therapists) drive women crazy, in 
order to maintain a male sexist hegemony. The Brovermans (19) demonstrated 
that clinicians view “mentally healthy women” as passive, emotional, subjective, 
etc—characteristics they otherwise attribute to “mentally unhealthy adults.” Does 
this pressure their patients into diminishing or even self-destructive roles? It seems 
that therapists support sex-role stereotypes and the status quo in general (1, 83). 

There is strong feeling in counter-cultural and radical feminist circles that 
psychotherapy in general, and certainly the psychotherapy establishment, is de- 
structive to the human potential of women. Various groups have established lists 
of accredited, feminist therapists, who are considered to be freer of exploitative 
bias toward women than establishment therapists, although the categories 
overlap. There has been a proliferation of therapy substitutes for women: 1. 
consciousness-raising groups originally developed by the National Organization 
for Women, but now burgeoning in various other institutions; 2. assertiveness- 
training groups, generally using behavior modification techniques such as 
modeling, role-playing, desensitization, etc; 3. continuing education programs at 
colleges and universities, which combine vocational and sensitivity training; 4. 
encounter and sensitivity training groups focusing on women; and 5. associations 
of para-professionals stressing supportive, assertive, and confrontational methods. 

Polemic aside, what does the literature reveal of professional response to the 
issues raised? There is much theoretical discussion but little systematic develop- 
ment or research on theory or practice. A survey of the research literature on 
psychotherapy and mental illness relating to women, and female sexuality, reveals 
a heavy predominance of interest in two areas: abortion and frigidity. In a 
summary of medical and psychiatric attitudes about psychiatric indications for 
induced abortion, White (146) asserted that individual physicians held rabid 
opinions about criteria but were in total disagreement. He says, “Is it possible that 
the pregnant woman symbolizes the proof of male potency and that if we loosen 
our rule over women and grant them the right to dispose of that proof when they 
want to, we men then feel terribly threatened lest women can at will rob us of the 
proof of our potency and masculinity?” 

Turning to the literature on frigidity, most articles deal with it as a dysfunctional 
symptom resulting from ignorance, fear, hostility, etc. With regard to treatment, 
several papers focus on the use of behavior modification (e.g. 34, 64), hypnosis 
with and without stag films (9),’ and similar more or less mechanistic techniques 
(31). While the psychoanalysts do look at frigidity in relation to the woman’s life 
history and pattern, it is still seen as a pathological symptom, never as a conflicted 
attempt to control her anatomical destiny. Fisher (41) has written a large book on 
The Female Orgasm, which has been criticized on this as well as other grounds by 
Money et al (95). An alternative way of looking at frigidity, similarly conjectural, is 
in terms of existential conflict rather than inconvenience or reproach to the male, 
or defect or failure in the female. Sherfey (122) has observed that “each orgasm 


7 Sales brochure for slides and tape presentation by Gilbert MacVaugh, Frigidity: Suc- 
cessful Treatment in One Hypnotic Imprint Session with the Oriental-Relaxation Technique. 
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tends to increase pelvic vasocongestion,” hence readiness for further orgasms. 
Since most sexual partners could not meet the women’s level of repeated readi- 
ness, her “frigidity” could be seen as the self-protective outcome of an 
approach-avoidance conflict, whereby she avoids chronic pelvic congestion—a 
condition which in fact does occur in many women (and which Freud considered 
the basis of anxiety neurosis). Even if a tireless partner were available, the prospect 
of reversion to an instinct-dominated existence, where every orgasm led to more 
until a state of physical exhaustion was reached, also conflicts realistically with a 
human range of goals. 

Other popular topics are represented by the many clinical reports on contra- 
ceptives for women’s use, on treatments with psychotropic drugs, and on the 
effects of gynecological surgery on women’s sex life. The charge that therapists see 
women as functional or dysfunctional objects rather than as humans (see 25, pp. 
47, 73, 75-82) seems to have some foundation in the selection of such topics for 
communication to one’s peers about one’s work with women patients. It is also 
true that there are increasing numbers of articles sensitive to women’s emotional 
problems in realistic contexts (91). 

The remainder of this article will deal with a few promising developments and 
more critical rhetoric in psychoanalytic thought, which still seems to be the most 
influential theoretical basis for treatment situations in clinics and private practice. 

Of the numerous articles calling for the revision of psychoanalytic theory and 
practice, one by Kronsky (73) is outstanding: soundly based in psychoanalytic 
understanding, free from rhetoric, and specific in its recommendations to the 
therapist. Kronsky sees women as frequently suffering from a free-floating sense 
of guilt resulting from inconsistent super-ego prohibitions or a double bind. She 
urges that the therapist avoid interpretation or mirroring while the patient is 
venting her rage, her competitiveness, hatred, and envy of men, lest these tech- 
niques result in re-repression, with these feelings becoming ego-dystonic, and an 
inauthentic femininity covering all. Moulton (97) speaks in similar terms and 
amplifies these themes. Chasseguet-Smirgel (23) has collected a book of articles by 
French women analysts who regard fulfillment as covering a broader range than 
matehood and motherhood. 

Some male analysts have also spoken out on various aspects of the weakness of 
orthodox analytic theory in dealing with women’s development. Gelb (45) states, 
“I believe that many in the field of psychoanalysis have been guilty of contributing 
heavily to the coercive institutionalization of male and female roles which have 
little to do with sexual identity.” While women and men have been struggling 
toward competence and relatedness, many psychoanalysts discourse about the 
glory of passivity to women and about “reestablishing masculine identity” to 
men—rather than moving toward full capacity as human beings. Chodoff (25) 
points out that the evidence that Freud got from his patients for his theory of 
infantile sexuality can be explained by the demand characteristics of the doctor- 
patient relationship. Chodoff states, “I submit that the role attributed to infantile 
sexuality in general and to penis envy in particular ... cannot be proved on 
biologic and other empirical grounds, no matter how widely and repetitiously they 
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are given the status of unassailable truths in large segments of educated opinion in 
the United States.” If the evidence for infantile sexuality is not convincing, Freud’s 
theory of feminine psychology needs to be revised. Salzman (117) underlines the 
need to recognize that “the psychology of the female ... is determined by her 
inherent biological potentiality in a changing cultural matrix.” He stresses the 
importance of religious doctrines and technological limitations in presumptions 
about woman’s natural role. Marmor (84) observes that the fallacy in the common 
argument that the male’s aggressivity and the female’s passivity are both “natural” 
lies in the confusion between what statistically describes the behavior of men and 
women and what motivates that behavior. 

Miller (91) holds that psychoanalysis is now at a turning point, the old sexual 
theories having failed to solve people’s problems in living. She points out that “the 
notions of renunciation and disengagement inherent in the ‘masculine’ growth 
model have meant limitations on full human development for men and women 
alike.” 

The onslaught against Freudian theory received its greatest recent impetus 
from implications of the work of Masters and Johnson and of Sherfey (122). The 
distinction between “clitoral” and “vaginal” orgasms had been a linchpin of the 
theory of feminine sexuality, which was supposed to develop in two stages. The 
first was an immature “phallic,” “masculine” stage where the female’s enjoyment 
of her clitoris reflected her penis envy and rebellion against her inferior sexual 
equipment and castrated state. This had to be given up for the attainment of the 
second stage, a stage of “feminine” orgastic experience based on the passivity of 
the vagina and acceptance of her castrated state. Masters’ and Johnson’s studies 
have been accepted as demolishing the Freudian distinction between “clitoral” 
and “vaginal” orgasms in women, while Sherfey’s integration of major contribu- 
tions from embryology, biology, and gynecology has cast doubt on Freud’s 
depiction of female sexuality as a pallid counterpart of the male’s, and only per- 
ilously achieved at that. She started with the previously overlooked finding by 
embryologists that all mammalian embryos are now known to be anatomically 
female in the early stages of life—until the sixth week of fetal life in the human— 
when, in the presence of the Y chromosome, fetal androgens bring about the 
differentiation of the male from the female anlage. She then went on to theorize on 
the basis of endocrinological research that early human females developed an 
inordinate cyclic sexual capacity which must have been a major barrier to the 
evolution of modern man, to the extent that it interfered with maternal care. 
Feminine hypersexuality therefore had to be suppressed, first in the interest of 
longer and more richly developmental human childhood, and then in the interest 
of property rights and ascertainable lines of descent after settled agricultural 
economies came into being. Although the race needed the suppression of female 
hypersexuality, one might guess that the individual male suffered from the 
comparison with this female, especially before the knowledge of paternity. The 
uneasy attainment of this giant step for mankind may have had to be buttressed by 
the development of the compensatory hypermasculinity which has characterized 
so many human societies. 
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Freud postulated that the development to psychosexual maturity was more 
difficult and uncertain for women, since their first love object was of the same sex, 
whereas males have heterosexual objects from the beginning. With the male, 
heterosexuality was supposed to be primary, while for the female it is only 
achieved after tortuous changes. Stoller (133) offers other observations and 
thoughts in connection with Freud’s postulation. Before the mother becomes the 
male infant’s love object, there is a stage of symbiosis when he has not yet 
separated from his mother and distinguished his body and psyche as differ- 
ent from hers—and “she is female with a feminine gender identity.” Therefore, 
heterosexuality is not a given for males, but an achievement, whereas femininity 
for females is given by the earliest identification, not tortuously arrived at through 
castration, humiliation, renunciation as Freud said. Symbiosis anxiety is seen as an 
important motive force in the development of masculinity. 

Gadpaille (44), in a review of physiological and other research, wrote in 1972: 


Perhaps even male chauvinism and the worldwide predominance of male domination 
... (is) intensely motivated as a denial of an unconscious awareness by man of his 
greater fragility and vulnerability, both physically and psychosexually, compared to 
women. ... The male is more vulnerable in virtually every way with the exception of 
physical prowess: there is greater difficulty in achieving maleness and masculinity, 
coupled with an increased vulnerability of males to disruption of sexual function and to 
physical and psychosexual disorders. 


In confirmation, there is the 1972 publication by Ginsberg, Frosch & Shapiro 
(46) observing that the obsolescence of the virginity taboo has disrupted the 
former ecological balance in society resulting in disequilibrium for males, and the 
rise of what they call “the new impotence.” 

Madigan (82) found sex mortality differentials to be biologically caused, not the 
result of social conditions. He found that both sexes equally lack resistance to 
infectious and contagious diseases, but that women have greater constitutional 
resistance to degenerative diseases. 

Paralleling the critique of Freudian theory regarding female development and 
malfunctioning is a revival or shift of interest to Jungian psychology with its stress 
on the existence of both masculine and feminine polarities in each individual (30, 
51). The research referred to in the earlier section on sex role definitions is also 
germane. 

Various streams of thought have been flowing together in the last few years 
which suggest a need for far-ranging reorientation of the androcentric epistemol- 
ogy of our culture if a psychological understanding of women and men is to be 
developed which can bear reality testing. Morgan (96) offers a sparkling alterna- 
tive to the androcentric evolutionist theories. She says: 


I believe the deeply rooted semantic confusion between ‘man’ as a male and ‘man’ as a 
species has been fed back into and vitiated a great deal of the speculation that goes on 
about the origins, development, and nature of the human race.... A very high 
proportion of the thinking on these topics is androcentric ... in the same way as 
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pre-Copernican thinking was geocentric. .. . One of these days an evolutionist is going 
to strike a palm against his large-domed head and cry: ‘Of course! We assumed the 
first human being was a man!’ 


Daly (28) proclaims the need for a counteraction to patriarchy’s naming of 
reality and morality as in the story of the Fall of Adam and Eve, whereby the 
“male’s viewpoint is metamorphosed into God’s viewpoint” (28, p. 47). According 
to myths, women are weak and evil (98). Theodor Reik (110) discusses the myth of 
the Creation as a denial of the reality that man is born of woman, not vice versa, 
and also in terms of what Stoller called symbiosis anxiety. Lederer (74) also goes 
into prehistory and paleoanthropology to decipher the counterphobic denigration 
of women in patriarchal society. There has been renewed interest in what can be 
learned of the genesis of androcentric psychoanalytic theories from the sociology 
of knowledge (67). We are clearly at a point where psychoanalytic theories of 
feminine development can thrive only with a transfusion from other disciplines: 
biological, historical, economic, and sociological. 


CONCLUDING COMMENTS 


This selective review on the psychology of women has, as we predicted at the 
outset, raised many questions. It is perhaps banal, but nevertheless necessary, to 
stress that there are unresolved conceptual and methodological issues within each 
area which will have to be raised again and reevaluated as the field develops. The 
different styles of the two authors probably reflect the differing state of the art in 
their fields. Still, if a major theme can be discerned, it is that of the sexual division 
of personality characteristics based on the male thinker’s view of reality. To the 
extent this division is accepted, individual women, and men to a lesser degree, are 
constricted in their personal fulfillment, and society is hobbled in both compe- 
tence and relatedness. 
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Five trends in hypnosis research, theory, and practice are notable since the 
previous review of 10 years ago (85). They are listed briefiy here, and the docu- 
mentation given in the body of the review. 

First, the controversy over the utility of the concept of “trance” or “hypnotic 
state” is kept alive by those who oppose the concept, but the issue has not proved 
to be a fruitful one. Only the nontrance advocates believe that they have intro- 
duced a new paradigm; those who seem to them to be committed to the trance 
concept commonly use the term in a phenomenal or descriptive way, without 
giving it the centrality that the opponents expect of the proponents. The residual 
arguments are in fact commonly over problems of experimental design and data 
analysis, having nothing to do with the trance concept as such; the differences are 
not paradigmatic but methodological. The recent introduction of neodissociation 
theory now offers an interpretation in terms of multiple cognitive controls whose 
hierarchical relationships may be modified by hypnotic procedures. Because these 
control systems can be fractionated in various ways, and are not as pervasive as a 
general “hypnotic state,” the neodissociation interpretation may substitute in 
some respects for the trance concept, while recognizing that the hyponotized 
subject is indeed in some way altered. 

Second, there is a convergence of all investigators upon the role played by the 
subject’s imaginative and fantasy productions. This is reflected in the supple- 
mentation of objective indices of responsiveness to suggestions by subjective 
measures derived from questioning the subject, and by a reduction in the emphasis 
given to conclusions based on superficial performances that are readily modified 
by situational variables. 


! The survey of literature pertinent to this review was concluded in February 1974. Because 
this review deals with the literature of 10 years, covering material since the preceding review 
(85), it has been necessary to be highly selective. I estimate that 200 relevant journal articles 
and books now appear in the scientific literature each year, so that limiting the reference list 
to 200 titles means that only I in 10 can be cited. 

? The review was aided by Grant MH-3859 from the NIMH. 
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Third, the increasing emphasis upon imaginative involvements and absorption 
calls attention to enduring individual differences in these abilities. The relative 
persistence of a subject’s level of hypnotic responsiveness through time and over 
the varying demands made of him in experimental contexts can now be inter- 
preted according to the roots of these fantasy involvements: influences upon him 
originating in childhood, and possibly a contribution through hereditary potential. 
Because the major source of variance in hypnotic responsiveness lies in these 
persistent individual differences, appropriate experimental designs are now taking 
them into account. 

Fourth, recent advances in brain psychophysiology give added support to the 
above basis for individual differences. Studies of hemispheric laterality of function 
now indicate that hypnotic responsiveness is associated with a preference for right 
hemisphere function in well lateralized right-handed subjects. This finding is 
coherent with the role of the right hemisphere in imagination. 

Fifth, the clinical applications of hypnosis in pain reduction and in psycho- 
therapy have become more sophisticated, and hypnosis is now used either alone 
or in conjunction with other types of behavior modification. The clinical findings 
have a feedback relationship with laboratory findings, and serve to correct exag- 
gerated claims for or against the efficacy of hypnosis. 


THE TRANCE CONTROVERSY 


The puzzling problem of the existence and nature of one or more hypnotic states 
has been magnified into a controversy by those who describe themselves as 
nonstate theorists and believe that by rejecting the state or trance concept they 
have introduced a new paradigm into the investigation of hypnotic-like behavior 
(e.g. Barber 10, 12; Sarbin & Coe 164). Such a polarization of standpoints has both 
its positive and its negative aspects: positive in that it requires all parties to clarify 
their concepts, negative in that it leads to defending a priori viewpoints rather than 
in engaging in a more open inquiry regarding the answers to perplexing problems. 


Three Issues Intertwined 


The arguments over state and nonstate center around three polar concepts that are 
not clearly distinguished in much of the controversial discussion. The first pair of 
concepts can be designated the trait vs state distinction, that is, whether hypnotic 
responsiveness is more a matter of the characteristics of the subject than of the 
state produced by hypnotic induction. This is subject to empirical answer, and the 
bulk of the evidence, so far as this issue is concerned, favors trait over state. The 
second of the polarities has to do with the relative importance of traits vs situa- 
tional factors. While the question of state is not at issue here, a preference for the 
influence of situational factors is evidenced by those who object to the state 
concept, reflecting similar controversies in contemporary social psychology (27). 
Here again the empirical evidence favors trait over situation, in that baseline 
responses to hypnotic suggestions contribute more to the variance of hypnotic 
responses than to modifications in the experimental manipulations (181). The 
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third dichotomy is that between state and situation. Granted that an experimental 
and control group are equated for their individual differences by random selec- 
tion, hence alike in “traits,” does hypnotic induction (presumably producing a 
“state” according to state theorists) succeed any better than nonhypnotic manip- 
ulations in producing hypnotic-like responses? The evidence increasingly favors 
some enhancement of hypnotic-like behavior as a consequence of induction, but 
the enhancement is not great and the theoretical problems are not simple, so that 
this is the least resolved of the three issues. 


Original Task Motivation Instructions (OTMI), with False Reference 
Standards, in Support of the Nonstate Position 


The thread that more than any other runs through the studies in Barber’s labora- 
tory from 1962 to about 1972 is the demonstration that task-motivation instruc- 
tions, administered without hypnotic induction, yield essentially the same results 
in responsiveness as a standard hypnotic induction as measured by the Barber 
Suggestibility Scale (BSS), hence opposing the need for a concept of hypnotic 
state. In order to examine this claim on empirical grounds, it is necessary first of all 
to clarify what is meant by these task-motivation instructions. In their original 
form they do more than exhort the subject to do his best; instead they provide false 
reference information through statements such as “everyone can do this if they 
really try. I myself can do it quite easily and all the previous subjects that partici- 
pated in the experiment were able to do it” (Barber & Calverley 13, emphasis 
supplied). Because all subjects have never done the things suggested, this is false 
information, and strongly invites the subject to comply lest he feel himself to be a 
deviant or a failure. 

Bowers (24) surmised that subjects confronted by such extreme compliance 
demands might report experiences about which they felt unsure in order to avoid 
acknowledging that they were deviant or uncooperative. He tested this conjecture 
by having subjects interviewed by a different experimenter after visual and audi- 
tory hallucinations were requested following the original task-motivation in- 
structions (OTMI). The new experimenter used “honesty instructions,” explaining 
to the subjects that science would be served by their being absolutely honest in 
reporting what they had experienced. He found what he had anticipated—that 
many subjects corrected their reports of hallucinations downward, denying that 
they had experienced them as vividly as reported. Spanos & Barber (181) supple- 
mented Bowers’ experiment by determining whether or not similar corrections 
would be made if “honesty instructions” followed hypnotic induction. They 
confirmed Bowers’ findings for visual hallucinations, showing not only that cor- 
rections were made following OTMI procedures, but that they were significantly 
more extreme than those following hypnotic induction. In other words, hypnotic 
induction in its standard form does not place as much pressure for compliance on 
the subjects as OTMI procedures do. The findings with auditory hallucinations 
were more ambiguous. As a consequence of these studies, it turns out that hyp- 
notic induction and OTMI are not alike in their consequences, and all the earlier 
studies in which they are compared on the basis of BSS scores, uncorrected for 


HS5 Payet 


22 HILGARD 


honesty, are to that extent less convincing. Barber’s laboratory has now adopted 
revised task motivation instructions (RTMI), in which the misinformation has 
been deleted. The new RTMI may take the form: “Other students were able to 
think of their hand in this way (as insensitive to pain) and it isn’t as hard as it 
seems” (Spanos, Barber & Lang 184). With these RTMI the “honesty correction” is 
no longer significant, but of course the results cannot be used to justify assertions 
based on the OTMI procedure. 


The Real-Simulator Design in Support of a Trance Concept 


At a descriptive level, subjects behave differently when hypnotized and when not 
hypnotized (153), and they report changes in the way they feel (198, 198a). This is 
sufficient reality to permit the experimenter to do his research and the clinician to 
use hypnotic procedures without their becoming embroiled in the state issue. 

One investigator in particular has attempted to delineate the changes in behav- 
ior that the hypnotic state is responsible for. Orne, in a series of papers, has 
proposed for this purpose the real-simulator design (136, 137). The responses of 
highly hypnotizable subjects (“reals”) are compared with those of insusceptible 
subjects who try to act as they believe highly hypnotizable subjects would (“sim- 
ulators”). Even if real and simulator perform alike it does not disprove a state 
concept, but it leaves the matter open for other evidence. What Orne hopes to find, 
however, are some differences that go beyond the obvious or implied demands of 
the hypnotist, something that hypnosis adds to these demands, not shown by the 
simulators. 

Occasionally significant differences are found between reals and simulators (51, 
138, 140), but- the method is not universally successful in finding such differences 
(41, 135). Sometimes simulators overreact, as by having too much posthypnotic 
amnesia when that has been suggested (212). 

The real-simulator design has been subjected to a number of criticisms: the reals 
and simulators come from different populations, and perhaps the differences 
reflect “traits” rather than “states” (10); the groups receive different instructions, . 
and hence are playing different roles (164); in its very effort at objectivity, the 
method falls into some of the traps of the nonstate theorists (23, 26). It is avowedly 
a design with severe limitations, which Orne recognizes, but when properly used 
tells us a good deal about hypnosis (172). 


THEORIES OF HYPNOSIS 


The arguments over trance are only indirectly related to theories of hypnosis 
because hypnotic-like behavior has to be explained in detail regardless of how it 1s 
conceptualized in general, 


Conditioning Theories 


Explanations in terms of learning have been favored in psychology for many 
years, encouraged by the conditioning explanations of Pavlov, Watson, Hull, 
Skinner, and many others. Although earlier applications to hypnosis were made 
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by Pavlov and later by Hull, Edmonston has been a more recent spokesman for the 
conditioned response interpretation (47). Another effort to use conditioning con- 
cepts consistently has been made by Barrios (16) to explain responses to posthyp- 
notic suggestion as higher-order conditioning. A total theory, with more than 
plausible analogies with conditioning, has not been developed. 


Input-Output (“Operational”) Theory 


Stimulus-response concepts were originally derived from the reflex arc and elab- 
orated in conditioning theories. Broadened out by functionalists into a concep- 
tion of antecedent and consequent variables, the contemporary derivative ofthese 
early positions has become an input-output logic that can be applied to various 
forms of information processing or situation-behavior sequences. Barber (10) has 
adopted this general logic for the interpretation of hypnotic behavior, seeking to 
find those antecedent variables (task-attitude, task-expectancy, task-motivation, 
toning and wording of suggestions, and tone and wording of inquiry) that deter- 
mine the responses that are recorded. His omission of individual differences in 
base rates seems to arise because these are hard to conceive as antecedent vari- 
ables; base rates have to be measured as consequents. In a formal sense, Barber’s 
theory is a set of experimental procedures for assuring reproducibility of results. 
He does not order the data according to any explanatory principles because, in 
effect, he believes that there is nothing unusual about hypnosis to require ex- 
planation. More recently, as goal-directed fantasy has come to play a larger part in 
his experiments, especially those with Spanos, fantasy evocation tends to become 
more nearly an explanatory variable (180, 182, 183). 


Role Theory 


An exposition of book length now supplements Sarbin’s earlier expositions of his 
role theory (163). The current theory of Sarbin & Coe (164) is elaborated as a 
social communication interpretation, one in which a person adopts a role suited to 
the other and behaves accordingly. He may become very deeply engrossed in the 
role (that is, he may show organismic involvement in it) even to the degree that it 
becomes irreversible. Because such organismic involvement is accepted, there is 
little difference to be detected between this position and the state-theories to which 
it is set as alternative. However, it provides a convenient frame of reference and a 
consistent alternative vocabulary for discussing hypnotic phenomena. Unfortun- 
ately, it gives the impression that there is less remaining to be explained than is _ 
actually the case. 


Psychoanalytic Theory 


The Gill & Brenman (64, 65) interpretation of hypnosis as adaptive regression 
continues to be one focus of discussion. The position has been criticized from a 
nonstate position (154) and from an alternative ego-psychoanalytic standpoint 
(204). It has been supported by Gruenewald, Fromm & Oberlander (76), who find 
experimentally more primary process intrusions in Rorschach responses when 
subjects are hypnotized than when the same subjects are not hypnotized, and by 
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Gur (79) in his study of creativity. Lavoie et al (113) have reported that hypnotic 
susceptibility among schizophrenics correlates negatively with primary regression 
and positively with adaptive regression, again using Rorschach signs. Like role- 
theory, psychoanalysis provides a ready-made vocabulary and an imbeddedness 
of hypnosis within a theory of larger scope. 


Multiple-Aspect Theories 


The desire for parsimony is an esthetic feazure of most scientific theories, but a 
parsimonious explanation can be overdone when knowledge is uncertain, as, for 
example, if hypnosis is described as only adaptive regression, or nothing but role 
enactment. As an illustration of a theory o more dimensions, we may consider 
Shor’s (174), that has three dimensions or factors: hypnotic role-taking involve- 
ments, trance depth, and archaic involvem=nt. The first of these recognizes the 
merits of Sarbin’s position in its organismic involvement aspect; the second 
equates hypnosis with other altered states o consciousness; and the third derives 
from psychoanalytic theory. Trance depth refers to the fading of generalized 
reality orientation, and archaic involvementis similar to what psychoanalysts call 
repressive transference. 


Neodissociation Theory 


Dissociation theory was prevalent in the 19tk century and early in the 20th century 
as a description of what happened in hypnosis, but it went out of favor. I have only 
recently revived it in an altered form that I call neodissociation theory (89-91, 107). 
The theory postulates a hierarchy of control systems operating at any one time ina 
given individual, and sees hypnosis as moditying the hierarchical arrangement of 
these controls, so that some become segrezated (dissociated) from others. The 
theory is designated by a new name in order to disclaim some of the features 
commonly attributed to dissociation. For example, it has often been supposed that 
if a dissociation occurs, there should be less interference between simultaneous 
tasks, so that if one of them is performed oat of awareness, it should not affect a 
task performed with full awareness. This is:not the case; it can be shown experi- 
mentally that when a task is performed ou of awareness through hypnotic sug- 
gestion (or as a result of posthypnotic suggestion), there will be more interference 
between the tasks than when both are perfermed with full awareness (106, 190). 
The reason is simply that we are dealing witk multiple cognitive controls operating 
simultaneously or intermittently, and ther2 are added burdens on the control 
systems when a task has to be kept out of awareness while it is being performed; in 
other words, it costs some cognitive effort to deploy attention away from a task that 
is at all difficult to do. 

The neodissociation theory received support from experiments on pain reduc- 
tion within hypnosis when it was found through automatic writing or automatic 
talking that some cognitive systém within the person (called metaphorically “the 
hidden observer”) reported awareness of the pain even though neither pain nor 
suffering was reported in the hypnotic analgesic state (107). While the dissociation 
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in this case has some resemblance to amnesia, it differs from ordinary posthyp- 
notic amnesia in that the pain reported by “the hidden observer” was deflected 
from full awareness before it had ever become conscious. Once one has been 
alerted to the possibilities uncovered in these experiments, attention becomes 
drawn to a number of observations made over the years, especially by clinicians, 
that bear upon such dissociations (e.g. 167). 

If hypnosis is interpreted as but one method by which to reorder the hierarchies 
among multiple cognitive control systems, the interpretation has some advantage 
over a more general state theory. The neodissociation theory recognizes that 
control systems can be altered in various ways other than by hypnotic procedures. 
Some partial effects found within hypnotic practice are difficult to square with a 
theory requiring a highly pervasive hypnotic state. For example, an arm contrac- 
ture, suggested to persist after termination of hypnosis, may in fact persist after 
hypnosis is formally ended. The neodissociation interpretation permits the con- 
tracture to be referred to a partially dissociated control system affecting the 
muscles of an arm, without involving the whole of the person’s cognitive activities. 
Such an interpretation may very well temper the debates over state theory in that it 
satisfies the state theorist by recognizing some obvious changes in the control 
systems of the subject brought about by the hypnotic procedures, without offering 
an explanation for the specific changes on the basis of some characteristic that 
separates every instance of hypnosis from other alterations of cognitive controls. 


THE HYPNOTIZABLE PERSON 


Large individual differences in hypnotic responsiveness have long been recog- 
nized; more recently many efforts have been made to quantify these differences, to 
determine their stability and modifiability, to find personality and situational 
correlates of them, and to design experiments that take full account of their 
existence. 


Hypnotic Responsiveness Scales 


The scales for measuring hypnotic responsiveness go by various names such as 
depth scales, suggestibility scales, or susceptibility scales, but these names do not 
specify any important differentiating characteristics; each provides a sample of 
hypnotic-like behavior, scored according to the number of items “passed.” Fuller 
reports have appeared on scales that were earlier standardized (8, 86, 121), anda 
formal revision of an earlier scale has appeared (209). Scales that have been 
translated into other.languages and tested in other cultures, e.g. German (80a) and 
Japanese (203a), have shown very similar distributions of hypnotic responsive- 
ness. A newcomer, a short scale standardized on a large clinical population, is that 
of Spiegel, yielding a hypnotic induction profile score of 1 to 5 (not to be confused 
with the earlier Stanford profile scales), and giving prominence to his eye-roll sign 
(188, 189). The characteristics of his scale do not seem much different from the 
other available scales, and his scores correlate positively with them. The Stanford 
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Hypnotic Susceptibility Scale, Form A (SHSS-A) has been compared experi- 
mentally with the Barber Suggestibility Scale (BSS), and they have been found to 
have much in common, particularly if the subjective correction of the objective 
scores on the BSS has been made (158). 

Among the variants of these scales is a group alert scale that moves away from 
the association of hypnosis with sleep but still correlates with more familiar scales 
(205). 


Supplements to Standardized Scales 


Barber (10) introduced a subjective correction to the objective scores of the BSS, at 
first merely correcting downward those “objectively passed” items that seemed 
unconvincing to the subject. A separate subjective score was later introduced, in 
which the subjective convincingness of each item was scaled whether or not the 
item was objectively passed. With the new subjective scores, the correlation 
` between objective and subjective scores that was spuriously high with the old 
method (158) fell to .44, showing that the two scores have much less in common 
than originally supposed (185). 

Another supplement to the standardized scales is provided by clinical observa- 
tions and estimates, as used by Orne, Shor, and their associates in defining 
“plateau responsiveness,” that is, when the subject has reached a plateau after 
repeated hypnotic experiences (139, 175, 177). 


Stability of Measured Differences 


Some measure of consistency in human characteristics is congruent with the 
possibility of change; the question of relative stability has to be answered empir- 
ically. Some stability is shown by retest reliabilities with alternate forms of re- 
sponsiveness scales, commonly in the upper .80s or lower .90s. In a longterm retest 
with SHSS-A, subjects tested while they were students in a university were retested 
again at an average of 10 years later, and yielded a retest correlation of .60, despite 
the changes in living and working arrangements and the responsibilities of young 
adult life outside the university (130). 

A second evidence of stability comes from a possible hereditary component in 
hypnotic responsiveness, as derived from a study of twins and their families. 
Monozygous twins, separately tested at the same time, produced scores of the 
SHSS-A correlating .56, while dizygous twins correlated only .18, with their low 
correlations supported by those between other family members. The computed 
index of heritability (h?) of .62 was highly significant, and leaves open the possi- 
bility of some genetic factor (129). 

A third line of evidence derives from the continuity ween experiences in 
early childhood and those of adult life that are related to hypnotizability, espe- 
cially forms of imaginative involvement often encouraged in association with 
parents; some later influences of childhood discipline are also found (96). In other 
words, continuities can be provided by significant social-environmental EES 
as well as by hereditary ones. 
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Modification of Susceptibility 


A review of efforts to change hypnotic responsiveness in the laboratory lists some 
175 references, but not over one tenth of these have anything to do with persistent 
changes (45). 

Among the interventions that have been tried are clinical reassurances to relieve 
anxiety and create a more favorable attitude (38, 175); participation in “con- 
sciousness-expanding’’ or encounter group sessions (170, 197); methods resem- 
bling operant conditioning or behavior modification (161); various kinds of 
modeling or imitative behavior (43, 44, 105); sensory deprivation (162, 210); 
muscular relaxation induced by EMG feedback (211); special demonstrations and 
communications about hypnosis (44, 75); and requiring close attention to the 
hypnotist’s words by giving electric shocks for evidence of inattention (78). 

The most common finding of the studies has been that the scores show a mean 
gain of a few points over a control group, and that these gains are either fairly 
uniform across the board, or are greater for those with initial scores in the middle 
range. The final scores usually correlate substantially with the initial scores. The 
hope of producing hypnotic virtuosos out of originally low scorers has not been 
fulfilled. 


Personality Correlates of Hypnotic Responsiveness 


The most interesting relationships fall into four areas: imagery, imaginative in- 
volvements (absorption), creativity, and cerebral function (especially hemispheric 
laterality). Cerebral function will be considered later (p. 28). 

Despite some conflicting evidence, the residual result from relating imagery to 
hypnosis seems to be a low but positive relationship (96, 141, 145, 186, 193). 

The relationships between imaginative involvements and hypnosis that turned 
up conspicuously in the clinical studies of J. R. Hilgard (96, 97) have now been 
incorporated in a pencil-and-paper test that correlates positively with hypnotic 
susceptibility. Tellegen & Atkinson (199) added appropriate imaginative in- 
volvement or absorption items to others selected from the MMPI as most repre- 
sentative of the two commonly reported MMPI factors: a stability-neuroticism 
factor and an intraversion-extraversion factor. Subsequent factor analysis re- 
vealed three factors, two of them corresponding to the familiar MMPI factors, and 
one corresponding to the new absorption scale. Only absorption consistently cor- 
related with hypnotizability in two replications with large samples. 

Creativity, which must bear some relationship to free fantasy, has also produced 
some interesting if baffling relationships to hypnotic responsiveness (Bowers & 
Bowers 28, Bowers 29). Both sex and level of susceptibility have to be taken into 
consideration as moderating variables (Bowers 25). The correlations between 
creativity tests and hypnosis are usually found only for female subjects, and 
correlations vary within subject groups stratified according to levels of hypnotic 
ability. Others have also reported higher correlations for women than for men 
between hypnotic scores and creativity on a variety of tests (147). 
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Improving creativity through hypnosis is another matter from finding a corre- 
lation between hypnotizability and creativit scores. Recently R. C. Gur (79) has 
been able to show a relationship wherebr male subjects selected as high in 
hypnotizability score higher in creativity under hypnosis than in the nonhypnotic 
condition, particularly in figural creativity om a picture construction test. 

It should be noted that many “plausible” relationships between personality 
characteristics and hypnotic susceptibility axe not confirmed empirically. Among 
these are various components of cognitive styles such as tests of repression- 
sensitization, leveling-sharpening, or field d2pendence and independence. Some 
positive correlations commonly emerge when a number of tests are used, but 
sometimes the signs are reversed from one study to another (66, 178). 


PSYCHOPHYSIOLOGICAL CORRELATES OF HYPNOTIC 
RESPONSIVENESS AND OF THE HYPNOTIC CONDITION 


Numerous reviewers have come to the conclusion that no physiological indicator 
has been found to tell when a person is ıypnotized (9, 165), but the search 
continues. 


EEG Correlates of Hypnotic Responsiver ess 


A person who is highly responsive to hypnotic procedures tends to have a high 
resting EEG-alpha, in the range of 7 to 13 Hz. That is, not hypnotized, and sitting 
with eyes closed in a semi-darkened room, the more highly susceptible subject is 
more likely to produce alpha waves of hizher voltage and for longer overall 
duration (7, 122, 134). These studies have been criticized (52, 143), but more recent 
findings continue to support the relationship (131). Some studies, to be sure, show 
technical and statistical problems that make their results difficult to interpret (60, 
202). The intriguing possibility has been put “orward, with moderate experimental 
support, that hypnotic responsiveness can be-enhanced by feedback training in the 
production of alpha (50). 

Other electrical measures of brain activ ty, such as evoked potentials, have 
usually given negative results, although onestudy shows that under conditions of 
selective attention more highly hypnotizable subjects increased their sensory 


evoked responses more than the less highly susceptible (59). 
* 


Laterality of Function 
By now it is well known that some special zation of function (not mere lateral 
dominance) characterizes the activities of the two cerebral hemispheres in man. 
These findings have led to efforts to relate laterality to hypnotic responsiveness. 
When a subject is asked to solve a menta problem by an experimenter seated 
before him and looking him in the eyes, the subject tends to divert his eyes in a 
characteristic manner, either to the right or t the left. These conjugate lateral eye 
movements are activated by the opposite hemisphere, so that they can serve in part 
as an indicator of activity in that hemisphere. Bakan (5), the first to note a possible 
relationship to hypnosis, found a significent positive correlation between the 
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tendency to move the eyes to the left and hypnotic susceptibility, leading to the 
conjecture that the highly responsive subject tends to favor the right hemisphere. 
The results were subsequently confirmed, but in one case weakly (131a) and in 
another case somewhat indirectly (80). A subsequent experiment by R. C. Gur and 
R. E. Gur (in preparation) confirmed Bakan’s results strongly through the use of 
appropriate moderating variables (sex, handedness, and eyedness). Coherent with 
other reports, they found indications that right-handed males were more com- 
pletely lateralized than the right-handed females, and conversely, that left- 
handed females were more completely lateralized than left-handed males. The 
consequence was that eye movements predominantly to the right correlated neg- 
atively with hypnosis for right-handed males, and positively for left-handed 
females. These results are coherent with the interpretation of hypnosis as a right 
hemisphere function for those who are strongly right-handed, and a left hemi- 
sphere function for those who are-strongly left-handed. Although there was some 
relationship between degree of left-handedness and hypnotizability in males, 
Bakan’s (6) finding of a greater variability of hypnotic scores among the left- 
handed was not confirmed. 


Neutral Hypnosis 


The hope of finding a specific physiological correlate of the established hypnotic 
state requires some sort of definition of “neutral hypnosis,” because it is well 
known that the hypnotized subject can do anything from dancing a jig to lying ina 
stupor. Defining neutral hypnosis is a dubious procedure because the definition 
itself will bias the findings. Hypnosis cannot be defined as extreme relaxation 
because it can equally well be described as a hyperalert state (119, 205). The 
possibility of finding some physiological sign that will persist against the various 
backgrounds of hypnotic behavior is remote but the search goes on, for example, 
with respect to electromagnetic fields (151), electrodermal responses (48, 148), 
or eye movements distinguishing between the more and less susceptible subjects 
(207). 


Psychophysiological Correlates of Pain Reduction 
or Production within Hypnosis 


The behavioral successes of pain reduction in the laboratory through hypnosis 
have been demonstrated repeatedly (15, 55a, 87, 88, 93, 184). The studies agree 
that the amount of pain reduction is correlated with measured hypnotic respon- 
siveness, usually in the range of .40 to .50. It does not matter very much whether 
the suggestions for pain reduction are given in the nonhypnotic state or after 
hypnotic induction, so that the results are more attributable to individual differ- 
ences in responsiveness to suggestions than to hypnosis as such. The responsive- 
ness to suggestions of the hypnotic type can be distinguished, however, from 
placebo response (123). On the psychophysiological side, the usual finding is that 
the changes in either heart rate or blos€rpressimtend to be minimally affected by 
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suggestion, but the blood pressure still responded to the stress of ice water even 
though the subject reported no pain whatever. In ischemic pain produced by a 
tourniquet followed by exercise, Lenox (116) reported differential effects as 
between the nonhypnotic condition and hypnotic analgesia, with a leveling of 
both blood pressure and heart rate when th> pain was not being felt. Attempts to 
replicate his results in the same laboratory (unpublished) have met with only 
partial successs. 

Pain can also be produced in hypnosis th-ough suggestion, to serve as a control 
for the condition in which there is actual physical stress. Results are somewhat 
complex, but they do not generally parallel very closely the changes when the 
actual stress occurs (46, 88). 

The burden of the findings is that hypnotic analgesia does not have very 
profound effects on the usual physiological ndicators, a result consonant with the 
presence of pain reports by “the hidden observer” in the automatic talking ex- 
periments. The evidence is reviewed in greater detail in Hilgard & Hilgard (94). 


Unilateral Control of Skin Temperature ` 


Voluntary control of temperature differences between the hands has been report- 
ed (125, 155), but the separate effects of feedback and hypnosis still have to be 
clarified. Temperature changes are often no-ed, for example, in autogenic training 
(82a). In one study of heartrate modificafion in which autogenic training and 
feedback were used, some highly hypno-izable subjects appeared to be less 
responsive to feedback than less highly hypnotizable ones, possibly because of a 
distorted awareness of their own bodily processes (39). 


SPECIMEN HYPNOTIC PHENOMENA: AMNESIA, 
ILLUSIONS, HALLUCINATIONS, AND DREAMS 


Limitations of space forbid a thorough review of the various hypnotic phenomena 
that are subject to laboratory investigation. To avoid a too telegraphic summary, a 
few phenomena illustrative of cognitive distortion or imaginative production have 
been selected for review. l 


Amnesia 


Posthypnotic amnesia fits the pattern of dissociation because something which the 
subject once was able to recall (and will recall again) is temporarily inhibited and 
thus is lower in the hierarchy of available cognitive information. The studies have 
been reviewed by Cooper (37). Three issue: are prominently under discussion: 1. 
the degree to which amnesia is spontaneous following hypnosis, with the indica- 
tions that it is rarely present unless it is suggested or expected by the subject (36, 
92); 2. source amnesia, in which the content that is learned under hypnosis is 
recalled although the subject has forgotten -hat it was learned under hypnosis (36, 
51, 62); and 3. the relative importance of hypnotic induction before response to 
. amnesia suggestions (14, 185, 200). Induction turns out to be less important than 
‘might be expected. 
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Cooper (37) showed that the bimodality in distribution of posthypnotic amnesia 
is attributable to two underlying distributions: (a) ordinary forgetting of incidental 
learning, not recoverable; (b) truly suggested amnesia, in which the forgotten 
material is remembered again when a release signal is given. As a technica] matter, 
in scoring hypnotic scales recoverable amnesia should be used in a refined scoring 
of susceptibility to amnesia, a point also made by others (104, 133). The bimodality 
of amnesia scores occurs at all levels of scoring of hypnotic susceptibility (104); 
children’s amnesia scores are heavily influenced by ordinary forgetting. 

More highly hypnotizable subjects, when reporting a few items remembered 
from among a larger number because of amnesia suggestions, tend to report these 
items in a random order, while less susceptible subjects, reporting the same 
number of items, tend to give them in the order in which they occurred (54). It 
appears that amnesia suggestions in the more hypnotizable disrupt ordinary re- 
trieval processes. 

Memories dissociated in amnesia still interact with other cognitive processes, as 
neodissociation theory would predict. For example, interpolated learning in a 
retroactive paradigm interferes with the recall of prior learned material even 
_ though the subject is amnesic for the interpolated material (74). 


Visual Illusions and Hallucinations 


Parrish, Lundy & Leibowitz (142) showed that in hypnotically induced age re- 
gression two illusions changed in their frequency of effects in accordance with 
known age changes. Unfortunately, efforts to replicate the finding have been 
unsuccessful (4, 146). Ablating the background through negative hallucination 
failed to cancel the Ponzo illusion (128). A comparable finding was earlier re- 
ported for delayed auditory feedback; a more recent experiment has again 
reported that speech disruption through such feedback continues even though the 
subject is hypnotically deaf to his own voice (166). 

These largely negative results are countered by one positive finding with visual 
hallucinations. Graham (71), using a conditioned hallucination paradigm, found 
that the positive hallucination of a dark background on a white card produced the 
contrast hallucination, a gray circle appearing lighter against this background than 
against a white background. Simulating subjects failed to produce the effect. 

Eye movements to the hallucination of a moving object appear grossly similar to 
the movements that would occur if the object were physically there and moving 
(30, 55), but more exact measures show that the movements differ in detail from 
those in the presence of the physical object, whether it be the nystagmus move- 
ments of following horizontal movements of separated bars, or the regular pursuit 
movements in watching a swinging pendulum (72, 73, 117). 

Not classifying strictly as a study of either illusion or hallucination, but related 
in that there is a modification of visual stimulus processing, is a report by Graham 
& Leibowitz (70) on the modification of visual acuity. Myopic subjects improved 
their visual acuity both in the laboratory and in the outside world following 
hypnotic suggestions. The puzzling finding was that the change in acuity took 
place without any physical change in eye refraction as measured through a laser 
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technique that permitted simultaneous mezsurement of refraction and acuity. It 
appears that whatever change in acuity took place must have been owing to some 
change in processing of retinal information at higher levels. 


Sleep and Dreams 


Although hypnosis does not show the EEG characteristics of sleep, the interaction 
between sleep and hypnosis continues to be of interest. Kratochvil & Macdonald 
(109) appeared to show that a hypnotic condition induced prior to sleep could 
continue after genuine sleep intervened, but the possibility of a rapid 
reinstatement cannot be ruled out. 

Some subjects can respond to suggestions while EEG criteria indicate that they 
are asleep. For example, a suggestion such as “Whenever I say the word ‘pillow’ 
your pillow will fee] uncomfortable and you will move it” can be given in stage 
REM sleep, with the appropriate response elicited during a later REM period (53). 

Dreams within hypnosis and night dreams may have some modal differences, 
but their phenomenal contents clearly overlap (95, 196). The hypnotic dreams of 
the more highly susceptible subjects are mcre like their night dreams (196). Tart 
has reviewed the literature on hypnotic dreams and on attempts to control night 
dreams through hypnosis (194, 195), Books by Moss (132) and Sacerdote (159) 
consider hypnotic dreams in wider contexts. 

An interesting possibility was opened up by Arkin, Hastev & Reiser (3) when 
they employed posthypnotic suggestion to produce sleep-talking while the subject 
was dreaming, providing a description of dream content while the dream was 
occurring. The method is apparently limited to those who have a history of 
sleep-talking, but it is still fascinating (2). 


CLINICAL APPLICATIONS 


-A number of books have appeared on clinical approaches within the period of this 

review, but apart from the large book by Granone (69) in Italian, with a Spanish 
translation, there is no general survey of tke clinical literature. Mention may be 
made, however, of books designed largely for the practitioner (32, 84, 203). 
Collections of the clinical papers of Miltor. Erickson have appeared in two vol- 
umes (81, 82), and a summary of his methods is available in a journal article (17). 
Because of Erickson’s ingenuity and inventiveness, his techniques are worth 
studying for the manner in which they free the reader from stereotyped ideas 
about hypnotic procedures. 


Relation to Behavior Modification 


Because hypnosis and behavior modificaticn are both psychological, they readily 
suggest comparison. Several of the reports saow that they may be combined in the 
practice of the individual investigator (42, 115, 208). Many unsolved problems 
remain, with answers lacking on how to dec.de which is the method of choice fora 
given person with a given complaint. 
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Relation to Acupuncture 


The current interest in acupuncture has led to numerous discussions of the rela- 
tionship between hypnotic pain reduction and pain reduction by way of inserted 
needles. The successes of acupuncture in major surgery can be matched by equal 
successes of operations performed with the help of hypnosis. Pain reduction in 
animals, claimed for acupuncture, has not been demonstrated with hypnosis, 
though forms of animal hypnosis (animal catalepsy) have long been studied (150). 
The research evidence is not yet firm, but it appears that there are elements of 
suggestion in acupuncture processes, in addition to whatever physiological effects 
the needles or electrical stimulation may add to the control of pain (110, 173, 
206a). 


Symptom Control in Clinical Practice 


The relation between smoking and lung cancer has led many habitual smokers 
to seek to break the habit, and many have sought the help of hypnosis (102). 
Treatment varies from a number of sessions to a single session. Success in a single 
session has been reported by Spiegel (187), who begins with his hypnotic induction 
profile (189) and then has the subject commit himself to a standard set of self- 
‘suggestions, and to reinforce his resolve to give up smoking by an inconspicuous 
self-hypnotic procedure that can be practiced in public. He finds from records of 
hundreds of cases that the more hypnotically responsive the subject is, the more 
likely he is to give up smoking in the short run, but after a year only about 20% will 
still not be smoking, and this residual group does not appear to be weighted with 
‘the more highly hypnotizable. It is commendable to have his reports on a large 
number of cases, with sufficient follow-up so that both the successes and failures 
can be known. 

Migraine headache is a debilitating symptom normally recalcitrant to treat- 
ment, yet it has been successfully treated in a large number of cases through 
hypnosis, especially by Harding (83). Careful follow-ups of 90 cases showed 34 
with complete relief for periods up to 8 years, another 29 with substantial reduc- 
tion in frequency, duration, or intensity of the migraine periods, and the remain- 
ing 27 either failures or lost to follow-up. 

Another-troubling set of symptoms arises when a limb has been amputated and 
the patient has a “phantom limb” replacing the lost member, often registering 
pain. A series of 37 of these cases have been reported as treated hypnotically by 
Cedercreutz & Uusitalo (31) with considerable success, 20 of 37 being symptom- 
free after treatment, with 10 improved. A year later 8 remained symptom-free and 
10 improved. 

There is a large literature on the hypnotic control of other pains in surgery and 
in the clinic, pain from labor in childbirth, broken bones, dental extractions, back 
pains, burns, terminal cancer. The reviews by Barber (11, 12), because they 
emphasize limiting conditions, give the erroneous impression that there are few 
clinical successes and contradict his own success in the relief of laboratory pain. 
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Other reviews of hypnosis in the clinic give evidence of success, e.g. Scott (169), 
Crasilneck & Hall (40), and Marmer (124). 


Psychosomatic Problems 


Hypnosis has been tried with some success in a number of psychosomatic com- 
plaints (101, 144, 149). 

Warts have long been a topic for speculation (and superstition). Success in 
removing warts by suggestion has been reported (35, 192), but there have also been 
reported failures (199a). Confusing findings are also found for other skin erup- 
tions. Purpuric lesions (skin spots owing to slight hemorrhages) were produced asa 
result of suggested emotions (1). Black (20) reported success in altering an atopic 
skin inflammation, but an attempted replication failed (18). It appears that sug- 
gestion may succeed in subjects who meet two criteria: (a) they have a history of 
psychosomatic reactions in the system being studied; and (b) they are highly 
responsive to hypnotic suggestions. Negative results apparently occur when these 
conditions are not met; whether the conditions are sufficient to guarantee positive 
results is not known. 


Various symptoms 


To give some idea of the variety of uses that have been made of hypnosis in clinical 
practice, an arbitrary selection of topics is listed here with a single specimen 
citation of each: hiccups (157), insomnia (22), phobias (99), psoriasis (56), seizure 
states [adult (120), child (61)], heart and stress (213), stuttering (126). 


Patients Showing Psychotic Behavior 


Psychotherapy of patients with mild behavior disorders (“neurotic symptoms”) is 
commonplace in hypnotic practice, but it has often been assumed that psychotics 
could not be hypnotized. Recent evidence shows that schizophrenic patients, if 
they are in contact at all, show much the same distribution of hypnotic respon- 
siveness as other subjects (114, 206). They are also candidates for psychotherapy 
by way of hypnotic procedures (19). 


HYPNOSIS AS A TOOL IN PSYCHOLOGICAL 
INVESTIGATIONS NOT CONCERNED PRIMARILY 
WITH HYPNOSIS 


Any orderly phenomena become interesting in themselves but also have the 
potential of usefulness in experiments whose substantive interest lies elsewhere. 
Hypnosis has been used to control conditions in other experiments in a variety of 
contexts (21, 68, 118, 152). 


Learning, Memory, and Skilled Performances 


What can we learn about processes of acquisition, retention, and retrieval that are 
of general psychological significance by using hypnosis to control attention, emo- 
tional arousal, or other background aspects of learning and memory? The exper- ` 


SG 


HYPNOSIS 35 


iments of Blum and his associates provide good illustrations recently summarized 
by him (21). Using a few highly trained subjects he is able to control anxiety level, 
cognitive arousal, or pleasure, and determine the consequences for learning of 
these conditions as parameters of learning and performance. 


Social Motivation and Behavior 


An ingenious earlier study by Rosenberg (156) showed how hypnosis could be 
used to control attitudes to then current social problems and result in a cognitive 
reorganization according to a theory not itself based on hypnosis. Later, Zimbar- 
do and his associates (214) tested the theory of cognitive dissonance by studying 
changes in the interference with a learned performance by evoked pain. A ma- 
nipulation by cognitive dissonance reduced the amount of interference by the 
pain, and could be compared with the pain reduction when the interference was 
reduced by hypnosis. In a related experiment, hypnosis was used instead of 
epinephrine to produce an aroused state to test the effect of the social situation on 
perceived emotion (215). 


Psychopathology and Psychotherapy 


According to Levitt & Chapman (118), the most frequent use of hypnosis as a 
research supplement has been in the artificial induction of emotional and psy- 
chopathological states in order to allow investigation of their underlying dynamics 
(e.g. 171, 179, 191). 


Monitoring and Controlling Subjective States 
and Their Physiological Correlates 


Hypnosis permits the maintenance of a subject’s state of arousal at a remarkably 
uniform level, ifthat is desired. It may be that the successes in unilateral control of 
hand temperature (125, 159), in which hypnosis is combined with biofeedback, 
have relied on a kind of “verbal curare” to maintain some of the uniformities that 
Miller and his associates have produced by curarizing their rats in order to 
condition such unilateral vasomotor responses (127). 


Cautions in the Use of Hypnosis 


The hazards of hypnosis to the subject are minimal (98), but the experimenter may 
risk experimental failure by expecting too much from hypnosis. While important 
effects can be produced through hypnotic-like suggestions, the effects have to be 
studied for themselves in pilot studies and cannot be assumed to have the status of 
adequate controls until they have been tested; fortunately, many of them will pass 
the test. 


CONTEMPORARY HISTORY 


The years since the earlier review have witnessed a substantial increase in hypnotic 
interest and activity, including the establishment of Division 30 within the APA, 
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with a membership of 433 in 1973, and a revitalized International Society of 
Hypnosis, bringing together those from around the world who are interested in the 
scientific and clinical aspects of hypnosis. 

The boundaries of hypnosis, especially in its clinical applications, are confused 
by two related developments. The first is autogenic training, a somewhat ritualized 
relaxation technique developed by Schultz in Germany, now widely used 
throughout the world, especially in Germany and Japan (168). Its practitioners 
recognize a close relationship to hypnosis (especially autohypnosis) and com- 
monly take leadership in hypnotic societies (Gheorghiu, Langen & Velden 63). 
The second is sophrology, a form of psychotherapy founded by Caycedo in Spain 
and Gubel in Argentina, and now widely adopted throughout Mexico and South 
America. Although some of its adherents distinguish between it and hypnosis (on 
the grounds that it is somewhat broader) others seem to make no distinction (77). 


Books Offering Major Reviews of the Research Literature 


Several recent books in English provide a wide coverage of the experimental 
literature (10, 33, 58, 67, 86, 164). Three books provide collections of reports from 
international congresses or other international seminars (34, 111, 112). Major 
books continue to appear in other languages (33, 69, 103, 108). More specialized 
books have been referred to under appropriate topics. 


Historical Reviews 


A number of original documents important in the early history of mesmerism and 
hypnosis have been republished in the collections edited by Shor & Orne (176) and 
Tinterow (201). Ellenberger’s book (49) deserves special mention because from its 
title it would not be inferred that it is the best history of hypnosis now available. 


A LOOK TO THE FUTURE 


Hull (100), ready to leave the field after his book was written, bequeathed 102 
experimentable problems for the future investigator. Many of his questions 
remain unanswered, Fromm’s (57) anticipations for future research will be inter- 
esting to review after more years have passed. The gratifying advances in experi- 
mental studies need to be supplemented by multivariate approaches to bring order 
out of the countless variables with minor effects. The anecdotal clinical literature, 
valuable as it is, needs more systematic studies to determine who can profit by 
what kind of treatment, with attention to measurable individual differences as well 
as the influences of the patient’s personal history. 
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In recent years, public concern about nonmedical drug use has stimulated a 
generous allocation of research funds, and this has resulted in a very large body of 
literature over a wide range of disciplines. The present brief review is necessarily 
highly selective, both in terms of the areas covered and the literature cited within 
specific fields. Some areas have been arbitrarily excluded because of the limits of 
space; others such as pharmacology and animal experimentation have been 
omitted because of the author’s lack of familiarity with the literature. Alcohol and 
tobacco are not considered, although the aggregate individual and social costs 
resulting from their use is clearly much larger than that for other drugs. I have 
attempted to anticipate the topics of general reader interest—emphasizing such 
areas as current trends in the heroin epidemic, the individual and social impact of 
marijuana use, evaluations and new developments in treatment and the effec- 
tiveness of prevention efforts. Special attention has been given to the epidemio- 
logical and behavioral research which describe the dimensions, significance, and 
trends of current nonmedical drug use. 

Probably the best general reviews of nonmedical drug use are contained in the 
recent reports of the U.S. and Canadian drug commissions. Both commissions 
issued separate reports on marijuana (23, 80), followed by documents on the 
overall subject of drug abuse and social policy (24, 79). Reviews of current 
marijuana research are provided by the annual Marihuana and Health reports to 
Congress (90). The Federal Strategy for Drug Abuse and Drug Traffic Prevention 
reports (1973 and 1974) describe the overall Federal activities and goals, and area 
good source of current statistics related to drug use, treatment, and prevention (99, 
100). 


l Preparation of this chapter was supported by Research Scientist Award No. 
KO5-DA-70182 from the National Institute of Mental Health. 
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THE RECENT EPIDEMIC 


Before examining the current prevalence and patterns of drug use, it is of interest 
to note some of the unique characteristics of the recent drug epidemic among 
middle class youth. Its initiation in the early 1960s was largely unrelated to the 
then existing population of users among lower class minority groups. Only opiates, 
marijuana, and cocaine were proscribed at that time, and the limited nonmedical 
use of amphetamines and barbiturates was not viewed as a significant problem. 
Hallucinogen use was almost entirely limited to the ritual consumption of peyote 
by the Indians. The introduction of marijuana and hallucinogens to the middle 
class was closely related to the hippie movement and the popularization of LSD 
which preceded this phenomenon (70). LSD was probably crucial to the formation 
of the hippie or psychedelic movement, but the ideological and life-style features 
extended far beyond the associated drug use. Its spread was facilitated by the mass 
media, popular music, and the introduction of the underground press. In the 
opinion of the reviewer, the hippie. movement of the 1960s was quite a significant 
social phenomenon. While it did not fully involve more than a small proportion of 
the total youth population, it influenced the philosophy, music, dress, attitudes, 
values, and drug-using behavior of a much larger group (69). Much of the wide- 
spread adoption of marijuana and other drug use can be viewed as marginal 
participation of the masses in the styles set by the hippie movement. 

With the popularizing of the hippie movement, the original significance of LSD 
and other drugs was largely dissipated, and ideological factors are not regarded as 
a major motivation for use at present. Nevertheless, the explanation for many 
aspects of current drug usage can be traced to the manner of introduction into the 
culture. First, usage continues to be heavily concentrated among the young—that 
segment of the population most susceptible to fads and styles. Second, whereas 
illicit drug use had been largely limited to lower class minority groups in the past, 
the new phenomena were predominantly among middle.class whites. For obvious 
reasons, the former groups were not responsive to a movement advocating drop- 
ping out of an overmaterialistic society. Third, normal curiosity and rebelliousness 
are more often factors in initiation than are personality defects or poor family 
backgrounds. Also, the ratio of female to male users is higher than for previous 
illicit drug usage, probably because of the higher acceptability of the behavior 
within the peer group. Fourth, the predominant pattern of infrequent usage of 
small quantities reflects the fact that most individuals are participating in a fad or 
style rather than being primarily attracted by the pharmacological properties of 
the drug. Finally, the rapid spread of drug use among secondary and college 
students throughout most countries of the world is clearly a copying of a style 
initiated in the United States. Some of these countries such as India have had 
lower class cannabis-using populations for centuries, but current student usage is 
an adoption of Western styles rather than a diffusion from indigenous sources. 
With the peak of the hippie movement now several years past, some of the above 
relationships are beginning to become less relevant; however, they still need to be 
considered in interpreting the present status of nonmedical drug use. 
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PREVALENCE AND TRENDS 
Opiates 


The available data permit only a rough estimate of the number of narcotics 
addicts currently in the United States. Since the population is not sufficiently 
accessible to household and student surveys (16), various other methods have been 
employed. Until recently the only estimates were based on case registers. The 
Bureau of Narcotics and Dangerous Drugs maintained a register based on state 
and federal arrest reports, and a New York City register included both arrest and 
health agency data (5). Attempts to estimate the proportion of the overall popu- 
lation contained in registers have been based on the percentage of narcotics deaths 
(5, 85) or new arrests (39) which are contained in the registers. Other estimates 
have been based on the incidence of reported serum hepatitis cases ( a disease 
often spread through unsterile self-injection procedures) and a compilation of 
estimates from state health agencies (20). The estimated addict population 
adopted by the Federal Strategy report in 1973 was 500 to 600 thousand (99). In 
1974, DuPont (30) estimated a population of 630,000 in the following categories: 
treatment—130,000; incarcerated—100,000; currently abstinent—200,000; and ac- 
tive addicts—200,000. This points up another difficulty in terms of the definition of 
an addict—whether persons temporarily abstinent for various reasons are to be 
included in the estimated addict population. There is also apparently a large 
number of persons who have used heroin to some extent but have not become 
addicted. In the 1972 household survey conducted for the U.S. Marihuana Com- 
mission, 0.6% of those age 12-17 and 1.3% of those 18 and over reported one or 
more experiences with heroin (approximately 1.8 million) (79). As mentioned 
earlier, such surveys typically miss the majority of the street addict population. 
Of more importance than the crude prevalence estimates are the data on 
incidence and trends that have recently become available. With the large number 
of heroin addicts currently in treatment, it is possible to determine in retrospect 
that a large heroin epidemic began in the mid-1960s and peaked around 1967-69. 
Data from New York show the incidence of first use peaked in 1967 and then 
declined sharply (43). These findings were replicated for persons entering treat- 
ment for each year from 1969 to 1973, so the results are apparently not an.artifact 
of the interval between initiation of use and treatment entry. DuPont & Greene 
(31) found a similar peak in 1969 for incidence of first use in Washington, and 
Newmeyer (82) reports the same results for San Francisco. Evidence also points to 
a decline in the number of active addicts. Death rates attributed to narcotics have 
declined, as have the number of cases of hepatitis (31, 43). In Washington, D.C., 
positive urine tests among arrestees in 1973 showed a significant decline over that 
for 1972 (31). Finally, the recent decrease in property-related crime has been 
related to the downturn in the prevalence of active heroin addiction (31, 43). 
While it now seems reasonably well established that an epidemic in the initia- 
tion of heroin addiction occurred in the late 1960s, and that the indices of active 
addiction have also now declined, the reasons for these events are less clear. Some 
of the increase in heroin addiction was clearly a spillover from the middle class 
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marijuana-hallucinogen epidemic discussed in the previous section. Sheppard, 
Gay & Smith (91) report that 40% of those addicted to heroin since 1967 in the 
Haight-Ashbury drug subculture of San Francisco were heavy users of psychedelic 
drugs prior to using heroin. However, there was also a marked parallel increase in 
the incidence of addiction among the lower class minority groups, which have 
traditionally been involved in heroin use. In what manner this may have been 
related to the middle class epidemic is unknown. It peaked considerably sooner 
than the incidence of marijuana and hallucinogen use among the middle class (see 
Table 1), although it did parallel the vital phase of the hippie movement (1966-69) 
fairly closely. 

The large increase in the availability of addiction treatment facilities in the past 
few years is undoubtedly the source of some of the decline in the indices of current 
active addiction. The number in methadone maintenance treatment alone has 
increased from 4000 in 1969 (109) to approximately 80,000 in 1974 (30). Another 
factor is the success of law enforcement in limiting the availability of heroin. 
Substantial increases in the retail price of heroin have resulted, with an accom- 
panying decline in the average amount consumed (13, 31). The decline in the 
indices of addiction prevalence may also be related to the natural decline of an 
epidemic. DuPont & Greene (31), as well as others, have noted the development of 
an anti-heroin attitude within the community. 


Marijuana 


Whereas there have been no satisfactory surveys of narcotics addiction in the 
United States, there is an abundance of surveys of marijuana and other drug use, 
mostly among student populations. The Marihuana Commission compiled a file 
of over 200 such surveys, involving more than 900,000 students (79). Table 1 
presents survey results for the period 1967-73. The Gallup national-poll of college 
students indicates the prevalence of any use of marijuana increased from 5% in 
1967 to 51% in 1971, and the 1971 and 1972 Marihuana Commission surveys found 
approximately 15% of both youth (age 12-17) and adults had used the drug. The 
annual high school survey of San Mateo County, California, is of particular 
interest, since it has employed a standard instrument over the past 7 years and 
bases its statistics on use during the preceding year rather than in terms of ever 
having used. The data in Table 1 are based on responses from approximately 
5000 high school seniors per year. San Mateo County is adjacent to San Francisco, 
and thus had an earlier and more pronounced exposure to the hippie movement 
than did most other areas. The drug usage rates are substantially higher than for 
most other parts of the country, and probably serve as good indicators of the max- 
imum level of acceptance for the various drugs. It will be noted that marijuana, 
which is widely regarded as a relatively harmless drug, has continued to increase 
in popularity, whereas use of the other drugs has been essentially stable over the 
past few years. 


Hallucinogens 


The opinion is often expressed that the use of LSD and other hallucinogens has 
declined sharply over the past few years. Certainly the aggressive propagandizing 
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of hallucinogen use has largely disappeared, along with the accompanying mass 
media publicity. Also, the incidence of hospitalizations for bad trips is markedly 
reduced. However, the available survey data does not show a corresponding 
decline in use. The rate of LSD use for San Mateo County high school seniors 
(Table 1) has been stable at about 17% for the past 6 years. Because LSD has often 
been sold under the name of mescaline in recent years, the actual usage of LSD 
may have increased.” 


Stimulants and Depressants 


Only a portion of nonmedical stimulant and sedative use is detectable through 
household and student surveys. The intravenous methamphetamine (speed) using 
population is generally not accessible, and the same is true for sedative and 
stimulant use by narcotic addicts, alcoholics, and other members of the street drug 
subculture. One Canadian study of a large sample of hospitalized alcoholics found 
10% were barbiturate abusers and one-third of these either were or had been 
addicted (28). In addition, two studies of heroin addicts admitted to the Lexington 
hospital in the mid-1960s found the incidence of barbiturate-sedative addiction to 
be 23% and 35% (17, 42). In 1971 Eckerman et al (32) examined urine specimens 
collected from random samples of arrestees in six major U.S. cities. The percent- 
age positive for barbiturates ranged from 1% in New York City to 14% for 
nondrug arrests in Los Angeles. In the three cities for which amphetamine 
screening was performed, the percentage of positive returns was approximately 
3%. 
While there are no reliable estimates of the prevalence of intravenous meth- 
amphetamine use, there is considerable evidence that usage is down from the peak 
level reached in the late 1960s. Illicit manufacture has apparently declined, and 
monthly monitoring of the illicit market by the Bureau of Narcotics and Danger- 
ous Drugs reveals that in most cities the drug is no longer available in large lots 
(67). Apparently, the high-dose intravenous use observed in the late 1960s has 
proved to be self-limiting, and most of the concentrated enclaves of users have 
disappeared (95). 

Recent reports by the U.S. and Canadian drug commissions warned of an 
increasing prevalence in nonmedical use of barbiturates and other sedatives (24, 
79). In particular, the nonbarbiturate sedative methaqualone has become popular 
on the illicit market (37). This reported increase in sedative use is not reflected in 
the available survey data (Table 1), but may exist among users not accessible to ` 
this approach. Certain locales, such as Southern California, are known to have a 
heavy concentration of sedative abusers. The U.S. Drug Commission also called 


2 LSD, phencyclidine (PCP), and MDA (3, 4-methylene-dioxyamphetamine) are the only 
hallucinogens available on the illicit market with any regularity at the present time. Of those 
hallucinogens analyzed by the Federal Bureau of Narcotics and Dangerous Drugs in fiscal 
year 1971, 80% were LSD and another 10% were LSD and phencyclidine in combination 
(67). However, illicit hallucinogens are now so frequently alleged to be mescaline that some 
recent student surveys show the use of mescaline or other non-LSD hallucinogens to be 
reported more frequently than LSD (9). 
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attention to the overuse of barbiturates via prescription, calling it “America’s 
hidden drug problem, comparable perhaps to the scope of hidden opiate depen- 
dence around the turn of the century” (79). In addition, the Federal government 
recently acted to sharply curtail the availability of prescription amphetamines, 
about 85% of which had been used for weight control (67). On the other hand, the 
authors of the only systematic national survey (1970-71) on the medical use of 
psychotropic drugs arrived at essentially the opposite conclusion—that usage was 
comparable to that for other Western industrialized nations, with only minimal 
excessive use (83). 

Cocaine has recently reappeared on the illicit market after an absense of many 
years. Since this is a relatively new phenomenon, there is little data on incidence 
and prevalence (107). The 1972 U.S. Drug Commission survey did find 1.5% of 
youth and 3.2% of adults reporting some usage—more than double the corre- 
sponding percentages for any use of heroin (79). 

While alcohol use is not included in this review, it should be noted that the rate 
of increase in illicit use of alcohol by minors during the past 2 or 3 years has 
probably exceeded that for marijuana or any of the other drugs discussed here (11, 
' 24,79). 


PATTERNS OF USE: FREQUENCY, AMOUNT, 
AND DURATION 


Opiates 


The available survey data indicate that the number of persons who have had 
some experience with heroin is probably three to four times the current addict 
population (79). Some portion of these occasional users will eventually become 
addicted, but it does appear that addiction is not as inevitable as was previously 
believed. One of the probable reasons for the lower addiction potential among 
today’s users is the typical pattern of multidrug use—heroin may simply be one of 
several drugs employed on an intermittent basis. Another likely factor is the small 
amounts of heroin currently consumed. DuPont & Greene (31) cite data for 
Washington, D.C. showing the average heroin content of the consumer package 
dropping from 7% in 1970 to 2% in 1973. In 1973, the average amount of heroin per 
package was only 5 mg. At this level, the typical addict would consume only about 
20 mg of heroin per day in comparison to around 60 mg a few years earlier, or a 
maintenance dose of 135 mg for an English addict in 1969 (98). As a result of the 
low quality and high prices, most treatment programs report that current addicts 
evidence only minimal withdrawal signs when deprived of the drug. 

One of the most significant recent studies in drug addiction concerns the 
subsequent drug-using behavior of U.S. military personnel addicted to opiates in 
Vietnam (86). Of a random sample of 451 army enlisted men who were returned to 
the United States in September 1971, 44% indicated they had used opiates in 
Vietnam and 20% felt they were addicted. Eight to 12 months after their return, 
10% reported some use of narcotics, 2% indicated current use, and 0.7% addiction. 
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Urinalysis tests showed 0.7% positive for narcotics. Only 5% of the sample received 
treatment after return to the United States, and almost all of this was brief and 
occurred prior to discharge. This natural experiment demonstrates that a drug- 
using behavior which had been considered highly resistive to change may be 
strongly dependent on environmental conditions. In Vietnam, pure heroin was 
available for $2.50 per 250 mg (43); the mode of administration (heroin smoking) 
was one not generally associated with addiction, and those affected were subjected 
to the stress, boredom, and low morale associated with restricted environmental 
conditions and combat in an unpopular war. Upon return to the United States, 
continued use of regular narcotics required the expenditure of large sums of 
money for low-quality heroin, which in turn required intravenous administration 
to produce significant effects; family and associates were disapproving of narcotics 
use; and the environmental conditions and opportunities for nondrug activities 
were greatly improved. 


Marijuana 


In assessing the significance of current marijuana usage in the United States, it is 
instructive to compare the patterns of use with those for cultures with a long 
history of cannabis use. First, with respect to frequency of use, the ratio of daily 
to occasional cannabis use is very much higher in those cultures than that currently 
observed in the United States. The Indian Hemp Commission (47) concluded that 
the more potent preparations (ganja and charas) were seldom used except as a 
daily habit; and a recent study of the extensive use in Jamaica estimated 40 to 50 
percent of the male population used on a daily basis (Marihuana and Health, 1972) 
(90). Other studies in North Africa have estimated that daily use represents 
one-third or less of the total population of users (87, 96). In any event, less-than- 
daily use in these cultures is generally ignored as having little significance. In 
contrast, the 1971 Marihuana Commission survey (1), which found 15% of the 
population had used marijuana, reported only 0.6% of youth and 0.5% of adults 
used marijuana one or more times per day. However, the more frequent patterns 
of use are apparently increasing. The 1972 Marihuana Commission survey found 
little change in overall prevalence of use, but the rate of daily or more frequent use 
more than doubled: 1.3% for youth and 1.4% for adults. Similarly, the 1971 results 
for one or.more uses per week were 2.5% for youth and 1.9% for adults, compared 
to 4.0% and 4.1% respectively in 1972. 

The same situation exists with regard to the amount of drug consumed. The 
typical occasional] user in the United States smokes about one 0.5 gram marijuana 
cigarette per occasion (68). Daily users average around three cigarettes per day 
with a maximum of about 10. Since most marijuana consumed in the United 
States averages no more than 1% of the active ingredient—tetrahydracannabinol- 
(THC)?—the casual user is estimated to consume about 5 mg THC per occasion. 


3 Analyses of 40 large seizures (100-2000 pounds each) made by U.S. Customs at the 
Mexican border in 1971 showed only 25% exceeded 1% in THC content (range 0.07 to 2.87 
percent) (14). 
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Daily users average around 15 mg per day, with very heavy users taking 50 mg. By 
comparison, moderate daily users in Egypt, Morocco, and India are estimated to 
consume around 40-60 mg THC per day, with heavy users averaging about 200 
mg (69). A recent study in Jamaica estimated heavy usage at around 400 mg THC 
per day (81), and 100 mg experimental doses are routinely administered to heavy 
hashish-using subjects in Greece (75). One ad libitum experiment in Greece found 
chronic hashish users averaged 3 to 7 grams of hashish (est. 150-350 mg THC) per 
day for 30 days (76). 

American users are quite capable of consuming larger amounts. A recent 21-day 
ad libitum study found subjects who were previously casual users averaged 60 mg 
THC per day during the experiment, while subjects who used marijuana daily 
prior to the experiment averaged 100 mg (81). Similarly, a sample of U.S. military 
users in Germany was reported to average 3 grams of hashish per day (est. 150 mg 
THC) (103). However, it may be concluded that under normal conditions, the 
average amount of marijuana consumed by the U.S. user represents only a small 
percent of that for other cultures with well-established cannabis-using popula- 
tions. 

The low frequency of marijuana use and small amounts consumed fit the thesis 
advanced at the beginning of this review—namely, that the recent middle class 
drug epidemic is more the adoption of a popular style rather than an attraction to 
the pharmacological properties of the drug. If this thesis is valid, it would be 
expected that a high proportion of current users would likely discontinue the 
practice after a few years. For instance, students primarily responding to peer 
influence might be expected to terminate usage after leaving school. Data on this 
issue are rather sparse at present. The 1972 Marihuana Commission survey found 
that, of those 18 and over who had ever used, one-half had now stopped (79). 
Longitudinal studies of individual users are now in progress, but to the reviewer’s 
knowledge there are no significant published results as yet. Studies of cannabis use 
in other cultures show initiation is also most common in adolescence and may be 
discontinued in adulthood. However, usage frequently persists for long periods; 
and persons using for 20-40 years are not uncommon. At least under conditions of 
cultural acceptance, cannabis usage appears to have a longevity comparable to 
that for alcohol and the opiates. 


Hallucinogens 


Unlike other psychoactive drugs, hallucinogens are rarely used on a daily basis. 
Tolerance to LSD occurs very rapidly, such that the effects are diminished on the 
second successive day of use and are largely absent after three daily administra- 
tions (19, 106). The same rapid tolerance exists for mescaline, psilocybin, and for 
some of the other hallucinogens. Thus, for practical purposes, these drugs cannot 
be used more than once or twice a week without losing much of their impact. 
Another reason for their relatively infrequent use is their lack of predictable 
effects. Habitual drug users generally seek to satisfy particular needs—escape, 
euphoria, anxiety relief, feelings of adequacy, etc. Since hallucinogens are quite 
inconsistent in terms of mood alteration, they do not meet this requirement. 
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Survey data support the expectation of infrequent use. Only about 25% of college 
students reporting having taken hallucinogens indicate usage within the past 
month (70). High school students tend toward more frequent use, with about 15% 
of those having used indicating one or more times per week (11, 16). The 1972 
Marihuana Commission found similar results; of those having taken hallucino- 
. gens, 30% of youth and 15% of adults indicated use within the past month (1). 

The amount of LSD taken per occasion is also relatively low. While the alleged 
potency of LSD dosage units on the illicit market is typically 200-250 mcg, 
quantitative evaluation of some 1000 samples in fiscal year 1971 found the median 
content to be 83 mcg (70). LSD dosages used for experimental and psychothera- 
peutic purposes are generally in the 50-200 mcg range. 

The hallucinogens also tend to be self-limiting in terms of duration of use. Most 
users have been found to either discontinue usage or use very infrequently after a 
period of 2 or 3 years (8, 71, 88). The reason for this pattern is related to the drug 
effects. In addition to not being a reliable mood modifier, the hallucinogen ex- 
perience tends to become less attractive with continued use. The major utility of 
these drugs is the uniqueness of the experience. As it is repeated many times, what 
was initially unique becomes more commonplace and there is a process of di- 
minishing returns. The effect of hallucinogens is indeed “a trip” and trips tend to 
lose their appeal when repeated too often. 


Stimulants and Depressants 


The pattern of nonmedical stimulant use will vary with the motivation of the user. 
Students studying for examinations, truck drivers, and other persons attempting to 
temporarily combat fatigue will take them on an as-needed basis. Occasional 
usage also seems to be the pattern for oral consumption of stimulants by young 
persons for euphoriant purposes. Similar usage occurs for barbiturates and other 
sedatives, often in combination with alcohol or marijuana. When used in this 
manner the typical dosages are around 20-30 mg for amphetamines and 300 mg 
for secobarbital or other short- or intermediate-acting barbiturates (67). 

An unknown number of persons use oral amphetamines nonmedically on a 
daily basis and acquire a high level of tolerance. Typical daily dosages for this 
group range from 200 to 700 mg per day (40, 53). 

In addition to the occasional use of sedatives by youths using many drugs, there 
is also substantial consumption among alcoholics, heroin addicts, methadone 
patients, and others. Usage may be intermittent or chronic. Tolerance to barbit- 
urates does not develop to the same extent as for amphetamines, and the ratio of 
lethal to therapeutic dose is also much smaller. The typical daily dosage reported 
for chronic nonmedical users is around 800-1500 mg (28), although daily dosages 
averaging 2400 mg have been observed among barbiturate-using heroin addicts 
(25). Daily dosages above the 500-800 mg level generally produce physical ad- 
diction (48). 

The pattern of intravenous methamphetamine use in the late 1960s was in 
“runs” lasting several days, with only a few days between runs. The drug was 
injected several times per day, and the typical daily dosages were of the order of 
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800-2400 mg (58). As mentioned earlier, this type of high-dose intravenous use 
appears to have sharply declined, although some observers indicate that there is 
now a more moderate pattern of intravenous use on a more episodic basis (24, 95). 

Relatively little is known about the current patterns of cocaine use other than 
from anecdotal reports. It is quite expensive, is considered to be a high-status drug, 
and is used mostly by the middle class marijuana-psychedelic group (107). The 
usual mode of administration is sniffing rather than the intravenous route 
employed by heroin addicts during an earlier period. Partly because of the high 
cost, most current usage is thought to be intermittent and in small amounts. 


ETIOLOGY AND METHODS OF SPREAD 


The limited scope of this review does not permit a review of the demographic, 
behavioral, and personality variables related to opiate addiction. Books by Chien 
and associates (18) and Ball & Chambers (7) provide good descriptions for the 
pre-1960 era. Robins (86) describes the military Vietnam opiate user as follows: 
“The man most likely to be detected positive in Vietnam was a young, single, 
black, low-ranking member of the Regular Army who had little education, came 
from a broken home, had an arrest history before Service, and had used drugs 
before Service.” l i 

_ The spread of heroin use has recently been investigated in terms of the conta- 
gious disease model (45, 46, 60), an approach first utilized by de Alarcon in 
England (26). The method shows considerable promise for identifying users in the 
early stages of an epidemic, especially when it occurs in a fairly well defined 
community. This, in turn, permits the early application of intervention ap- 
proaches. 

The characteristics of the current young middle class drug user have been 
described in dozens of studies—mostly for college populations. The early studies 
typically compared users and nonusers of marijuana. As the latter category re- 
ceded to a minority status in some schools, populations were generally categorized 
in terms of frequent, occasional, and nonuse. Users are more likely than nonusers 
to come from urban professional families with above average education and 
income (12, 21, 38). Especially during the early phases of the epidemic, use was 
much higher for white as opposed to minority groups (12, 84). Males outnumber 
females, especially in the more frequent-use categories (68). Users are somewhat 
more likely to come from broken homes (12, 92), or to describe their parents’ 
relationship as poor (6), and to have liberal parents (44). They are more likely to 
have fathers who use alcohol and tobacco and mothers who use tranquilizers (44, 
92). However, Kandel (54) recently employed data obtained from triads— parents, 
adolescent and best school friend—to establish that, while parent’s drug usage was 
positively related to adolescent use, peer group influence was much greater. 

Haagen (41) has conducted one of the more interesting studies of the social 
and psychological correlates of marijuana use. He collected extensive biographical 
and test data at the time of college admission (1965), when virtually none of the 
sample had used marijuana, and again following the third year of college when . 
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59% reported some use. At the time of admission, those who subsequently became 
occasional or frequent marijuana users scored slightly higher than the nonuser 
group on aptitude and achievement tests. On the other hand, the group that later 
used frequently had poorer study habits in high school, worked less hard, were 
more dissatisfied with school, and made lower grades than did the nonuser group. 
The frequent-using group were much more undecided as to their intended major 
at the time of admission, and continued to be uncertain about plans subsequent to 
college. They were more interested in social life, more accepting of nonconformist 
philosophy, and moved more in this direction during college. Another longitu- 
dinal study by Johnson (51a) followed 2200 males from the 10th grade in 1966 to 
one year past high school (1970). The findings are generally the same as those 
found by Haagen, but more limited in scope. 

Smith (93, 94) has also conducted a prospective study of high-school students, 
finding self and peer ratings of rebelliousness to be among the best predictors of 
those who subsequently became users of marijuana and other drugs. Jessor et al 
(50) have also reported some success in predicting, on the basis of various per- 
sonality, belief, and attitude scores, which high school nonusers would subse- 
quently initiate marijuana use. In addition, they found that those beginning use 
showed greater changes on these variables than did the continuing nonusers. None 
of these results occurred in the college sarhple. 

As would be expected, users are more likely to consume alcohol and tobacco. In 
general, they exhibit a more unstable life style with respect to residence, work, 
school, and goals (44). They receive more traffic violations (44, 51), are more likely 
to have sought psychiatric counseling (51), are less religious, belong to fewer 
organizations, and participate less in athletics (41, 44). Users have sexual relations 
at an earlier age, more frequently, and with more partners (44, 51). They exhibit 
more liberal and leftist political views, see themselves as outside the larger society, 
have less respect for authority, and are more likely to be activists (12, 38, 44). 


EFFECTS OF MARIJUANA AND HALLUCINOGEN USE 


Again, space does not permit more than a cursory examination of the literature in 
this area. At present the only clearly established physical complication resulting 
‚from heavy marijuana or hashish use is bronchitis (103). Two widely publicized 
preliminary studies have suggested possible serious physical damage from heavy 
marijuana use. One found evidence of cerebral atrophy among a sample of ten 
hospitalized patients with a history of marijuana and other drug use (15). The 
second reported a significantly lower level of cellular mediated immunity among 
marijuana users as compared to nonusers (77). Neither of these findings has been 
replicated. One recent study reported a depression of testosterone levels among a 
sample using marijuana four or more times per week (mean of 9 cigarettes per 
week) (56a). Soueif found a sample of 850 imprisoned for hashish use in Egypt 
tended to score lower than nonusers on various psychomotor and simple cognitive 
tests; however, adequate controls for age and education were not maintained (97). 
Several authors have attributed various adverse behavioral and personality 
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problems to the frequent use of marijuana, especially that by adolescents (55-57). 
Others have criticized these conclusions on methodological grounds, showing that 
the same results can be predicted equally well by other associated forms of 
deviance such as premarital sex and high school truancy (4, 51). Regardless of the 
extent to which the relationship is causal or associational, there is clear evidence 
that heavy marijuana and hallucinogen users do consistently exhibit high rates of 
unemployment and general underachievement (8, 10, 88). l 

Two recent experimental studies have explored the effect of marijuana on 
productivity. In one study subjects were engaged in a microeconomy system in 
which they were paid on a piecework basis for making wooden stools or weaving 
woolen belts over a period of 3 to 4 months (23). The results showed dose- 
dependent decrements in productivity, and discontinuation of marijuana was 
followed by an increase in productivity. The second study collected detailed data 
on the work output of Jamaican laborers before and after smoking marijuana (89). 
The results showed marijuana intake caused decreased productivity per unit of 
time, an increase in calories consumed per unit of work, and less organization in 
work movements, greater repetition, and greater variability as measured in a 
videotape analysis. 

With regard to hallucinogen effects, the long-term psychological effects of 
LSD have been reviewed recently by McWiliams & Tuttle (74). While 
the long-standing controversy over LSD and chromosomal alterations is not 
completely resolved, recent reviews suggest that it does not cause lasting chromo- 
some breaks in vivo (29, 61). Four studies have employed at least a portion of the 
Halstead-Reitan test battery in an attempt to measure any LSD-related organicity. 
Three of these (3, 22, 72) found that the LSD group scored worse than the 
comparison group on some of the tests, but the results were not consistent across 
the three studies. The other study found no effect (108). None was able to control 
for possible predrug differences in the LSD and comparison groups. Furthermore, 
those test differences found were generally not correlated with the number of 
reported LSD ingestions. 


PREVENTION 


Public concern over the recent drug epidemic has stimulated a large increase in 
prevention efforts in the past few years. Those directed at limiting the supply have 
been relatively successful with regard to heroin (13, 31). However, the overall 
impact of drug education efforts are regarded as highly uncertain.’ This sentiment 
is reflected by a 1973 moratorium on the Federal production and support of new 
educational and mass media materials related to drug abuse (99). Also, in 1973 the 
National Marihuana Commission (79) concluded that no drug education program 
in the United States or elsewhere had been proved sufficiently successful to 
warrant its recommendation, and therefore suggested a moratorium on all drug 


4 Good reviews of drug education and other prevention efforts are contained in Wald & 
Abrams (105) and Abrams, Garfield & Swisher (2). 
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education programs in the schools and the repeal of legislative requirements for 
such courses in the school curriculum. These rather drastic recommendations 
resulted from the inability to demonstrate that drug education programs were 
effective in reducing drug-using behavior, and in some instances the programs 
may well have been counterproductive (101, 102). Mass media drug prevention 
efforts were similarly unevaluated (78). 

Part of the drug education problem stemmed from the uniqueness of the middle 
class drug epidemic discussed at the beginning of this review. Previously, drug 
education stressed the health hazards of alcohol and tobacco and, while the 
warnings might be ignored, there was no active opposition to the material being 
presented. With the advent of the new drug epidemic, teachers were at a disad- 
vantage with respect to the students because of the lack of first-hand experience 
with the drugs being discussed. They were also faced with the problem of how to 
deal with drug use closely associated with an antiestablishment movement. 
Whereas student use of alcohol and tobacco had long been associated with rebel- 
lion against authority, there was previously no associated prodrug philosophy 
which claimed beneficial effects from the drug use per se. Finally, the teacher was 
forced to advocate complete abstinence with regard to illicit drugs, a stance that 
was likely to be rejected outright by the casual marijuana user. 

In the face of these difficulties, there has been a tendency to de-emphasize the 
drug issue by integrating the educational material with other relevant courses, and 
to utilize “affective training” techniques aimed at developing overall responsible 
behavior. Problem solutions, peer group pressure, and value clarification are 
explored by methods ranging from casual rap sessions to intensive encounter 
groups. Without questioning the potential value of these approaches, many do see 
a danger of overgeneralizing, i.e. dismissing cognitive drug education on the basis 
of experience obtained in the recent highly atypical drug epidemic. Drug use is not 
normally encountered as an integral part of an ideological movement replete with 
advocates, a supportive underground press, and an accompanying art and music 
form. 

Most of the secondary prevention techniques which have been employed have 
not come under the criticism leveled at drug education efforts. These include 
school counseling, “hot lines,” crisis centers, free clinics, and street drug analysis 
services. Such intervention approaches usually aim at preventing the young drug 
user from drifting into more destructive drug using or other behavior. They are 
typically more oriented toward encouraging moderation and good judgment than 
total abstinence, and serve a useful function in helping prevent further social 
isolation and alienation of a group that is not especially deviant in other respects. 


TREATMENT 


Treatment emphasis has been largely limited to narcotics addiction, and metha- 
done maintenance is currently the major modality. DuPont (30) estimated there 
were around 80,000 methadone maintenance patients in 1974, and that the 
number had been stable for the preceding year. With the exception of a few 
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localities, there are currently no waiting lists for methadone or other forms of 
treatment. The other well-defined volunteer modality is the therapeutic com- 
munity, with a probable population of 10-15 thousand. Civil commitment ac- 
counts for an additional 18,000 (not including those also on methadone 
maintenance) in the Federal, California, and New York programs (73). An un- 
known additional number receive treatment at various detoxification and other 
centers—primarily on an outpatient basis. DuPont (30) estimated a total treatment 
population of 130,000 in 1974. Since treatment capacity is no longer a limiting 
factor, Federal efforts are being made to increase the treatment population 
through an informal system of diversion from the criminal justice system 
(TASC--Treatment Alternatives to Street Crime) (99). | 

It is probably fair to conclude that methadone maintenance has passed the 
honeymoon stage. The criticism of substituting one narcotic for another has 
continued, and as yet there is little evidence that a significant portion of metha- 
done patients are eventually able to withdraw and remain abstinent. As this form 
of treatment has expanded and incorporated a less select population of addicts, the 
early one-year retention rates of 85%, found by Dole (36), have tended to drop 
closer to 40 to 50 percent (65). Some authors have also challenged the highly 
optimistic early reports of improved socialization in terms of employment and 
reduced crime (62). Nevertheless, methadone maintenance has been able to at- 
tract and retain a much higher number of narcotics addicts than any other form of 
volunteer treatment. There is little doubt that it has substantially reduced both the 
individual and social costs of addiction. Therapeutic communities (27) achieve 
excellent results for some addicts at relatively low cost; however, only a fraction of 
the addict population is attracted to this mode of treatment, and only about 10% of 
these remain (33, 63). Civil commitment, as it is now employed, is generally 
considered too coercive for the individual benefits achieved (59, 64); although for 
perhaps two-thirds of the addict population, some level of compulsion is probably 
necessary to limit the associated criminal costs of addiction (73). 

Experimental heroin maintenance programs have been proposed in both the 
United States and Canada as a means of attracting larger numbers of addicts into 
treatment (24, 104). This approach has been relatively successful in England,” but 
the differences in the history, size, and nature of the problem makes it unlikely that 
it will be adopted in the United States in the near future. 

A great deal of research has been directed toward developing a satisfactory 
opiate antagonist in the past few years. Cyclazocine has been in experimental use 
for several years, and oral doses effectively block opiate effects for 24 hours. 
However, it frequently produces undesirable side effects. Naloxone is an effective 
antagonist with little or no agonist activity; however, it is short acting, requires 
large doses, and is expensive. The most promising antagonist currently in the 
clinical testing phase is naltrexone. It is effective for 24 hours in moderate oral 
doses and has little agonist activity (66). While this represents a considerable 


> In actuality, most maintenance patients in England now receive methadone rather than 
heroin, although it is typically injected rather than taken orally (52). 
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advance in the search for effective narcotic antagonists, it does not solve the 
problem of treatment motivation on the part of the patient. Many observers 
believe that a means of producing a narcotic blockade for several days or weeks is 
necessary before this approach will have wide applicability. Research is also 
continuing on the long-acting methadone congeners (49). 

Treatment and preventive directions which are to be emphasized in current 
Federal plans include: expanded treatment for polydrug abuse; further attempts 
to provide job opportunities for the ex-drug addict; efforts to enlarge the treatment 
population through cooperation between therapeutic agencies and the criminal 
justice systems; school-based early intervention programs; and increased law 
enforcement efforts against illicit drug traffic (100). 
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The psychology of aging appears to have matured markedly during the 5 years 
recorded in this chapter. While the careful mapping of differences in performance 
and behavior characteristics of young and old organisms is continuing, there has 
been a significant shift of interest towards the direct explication of age changes by 
means of descriptive as well as experimental strategies. Psychologists interested in 
aging have established more contact with those engaged in the study of earlier 
developmental stages than has been the case previously, and in turn, have made 
significant methodological contributions to developmental psychology in general. 
Indeed, perhaps the most noteworthy change has been an increasing interest in 
life-span developmental psychology on the part of gerontologists who are now ` 
willing to give more than lip service tq the fact that organisms are not born old [but 
see Baltes (16)]. As evidence of this change, note, for example, the change in name 
of Division 20 from a “Division on Maturity and Old Age” to a “Division of Adult 
Development and Aging.” Equally important has been the recognition that the 
implicit decrement model of aging is unduly pessimistic and not necessarily 
isomorphic with empirical fact. The introduction of more sophisticated meth- 
odologies has highlighted the fact that much of our knowledge of adult develop- 
ment is based on the comparison of different cohorts, and consequently, for all we . 
know, in the past we might have collected data for a psychology of generational 
differences rather than for a psychology of aging. These’ issues have received 


l See NAPS document No. 02431 for 68 pages of supplementary material. Order from 
ASIS/NAPS c/o Microfiche Publications, 305 E. 46th Street, New York, NY 10017. Remit in 
advance for each NAPS accession number. Make checks payable to Microfiche Publications. 
Microfiche are $1.50, and photocopies are $10.70. Outside of the U.S. or Canada, postage is 
$.50 for a fiche or $2.00 for a photocopy. 
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thousand papers initially reviewed for this chapter. ) 


65 


66 SCHAIE & GRIBBIN 


considerable attention, as have the consideration of change or stability in social 
roles and the determination of psychophysiological correlates of age changes in 
behavior. The viability of the field is further demonstrated by APA’s significant 
participation in the 1971 White House Conference on Aging, which included a 
major task force report (96) and the appearance of a number of books on aging 
and life-span developmental psychology at a markedly incremental level of so- 
phistication. l 

Because of the huge number of publications which have appeared in this area 
since Botwinick’s chapter on Geropsychology in the 1970 Annual Review of Psy- 
chology (45a), some exclusions had to be made. First, we decided to review 
primarily literature on human development and to neglect work with animals. 
This decision was quite arbitrary and made simply because of the impossibility of 
covering everything. The second exclusion was quite purposive. That is, since the 
senior author has consistently argued that the pathology of old age is corollary but 
not causal to aging (287,288), it was felt that the literature on psychopathology and 
physiological deficit in the aged would more properly be reviewed in chapters 
devoted to the substantive issues of pathology and deficit. 

The plan of this chapter is to discuss first some of the major methodological 
issues; next to review briefly books primarily concerned with adult development; 
and then to proceed with a review of the most noteworthy substantive contribu- 
tions in the research literature. 

As in previous reviews, no attempt has been made at comprehensiveness. Our 
effort has been directed towards indicating the more significant research avenues, 
methodological innovations, and attempts at systematization. Covered in this 
chapter is the literature from the latter part of 1968 through the end of 1973, with 
some inclusion of papers appearing in early 1974. 


METHODOLOGICAL ISSUES 


We will first consider the question of how one should look at adult development; 
next, the implications ensuing from the debate concerning the relationship of data 
obtained from.studies employing the cross-sectional or longitudinal] method and 
the proposals for more sophisticated mixed strategies will be reviewed. Attention 
will also be given to multivariate methods particularly relevant to the study of 
adult development, the debate about the assessment of functional age, the issues 
concerning the prediction of life-span, and the beginning interest in manipulation 
of the age variable. 


How Should One Look at Adult Development? 


The developmentalist interested in childhood and adolescence has no problem in 
making credible a basic developmental model whose best fit would be a Gompertz 
type function involving decelerating rate of growth.? No such simple solution is 


3 This attitude has been challenged recently by the proposal to substitute behavior mod- 
ification paradigms for organismic models of development (cf 223). 
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readily available for the study of adult development. Most investigations in the 
past have implicitly accepted the validity of an irreversible decrement model and 
consequently have assumed that age differences in a given behavior between 
young and old (which most often yields differences in favor of the young) can 
indeed be interpreted as evidence of ontogenetic age decrement. However, as early 
as 1940 and again more recently, Kuhlen (182) has pointed out to us that much of 
what appears as age differences to the casual observer might indeed more properly 
be identified as the effect of sociocultural change. More recently, and quite inde- 
pendently, this issue has been formalized in the sociological literature by Ryder 
(276) and in the psychological literature by Schaie (281), by considering the 
relationship between cross-sectional and longitudinal data. 

Such analyses reveal the plausibility of alternative models. Indeed, for the 
period of adulthood, a stability model seems plausible for many variables of 
interest to the psychologist [notably the crystallized mental abilities (145) and 
many personality variables (228)]. Another alternative is a decrement-with- 
compensation model, which would permit both ontogenetic decrement as well as 
compensation by successively more favorable environmental life support systems. 
It is assumed here that many of the difficulties of the aged could be remedied by 
alternative modifications of the young and middle aged vis-a-vis the old (287, 315). 
A more radical alternative has been proposed by Wohlwill (346), who suggests 
that age in developmental studies ought to be treated as a dependent rather than 
an independent variable. 

Within the alternative models for adult age functions, there has been an in- 
creasing interest in trying to determine the relative contribution of heritability and 
environment in the maintenance of behavior in old age, an issue which clearly 
relates to public policy regarding retirement and rehabilitation and may well 
become as controversial as that over the role of compensatory education in 
childhood. Most pertinent methodological contributions to this issue have been 
provided by Cattell (69, 70), Buss (58), and by Schaie, Baltes & Nesselroade (289). 
Relevant discussions of the relation of possible age-related defects in chromosome 
structure are given by Jarvik & Kato (161). 

Studies concerned with the total life-span are faced with the problem that 
prospective investigations will frequently extend beyond the life expectancy of the 
investigator. Greater emphasis has therefore been placed upon retrospective ac- 
counts, and particularly the use of life-history analyses. Problems here concern the 
reliability of such information, the intrusion of contemporary stereotypes, and 
generalized differences in perceived social desirability (62, 157, 196, 300, 326). 
Back & Bourque (13) introduced an interesting life-graph approach which at one 
point in time collects subjective judgments for prospective and retrospective data 
on members of different cohorts over their anticipated life-span. 

Considerable attention has been given to the issue of sampling. Although 
continued demand is made for more representative samples, it has become evident 
that in a longitudinal study a sample which is representative of a given population 
at the study’s inception will become successively less representative due to the 
effects of nonrandom experimental mortality (sample attrition). If the sample is 
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maintained in its entirety throughout the life of the study, it will cease to be 
representative because of changes in the parent population. Studies of specific 
processes, therefore, might best be conducted with carefully selected non- 
representative samples which define the limits of the behavior of interest (see 
288, 295 for more detailed discussions of this issue). The effects of selective 
dropout of subjects have been documented in a number of studies which invari- 
ably show that residual samples score higher on tests of ability and are rated 
more in the socially desirable direction than is true for the initial sample (9, 24, 
43, 260, 291). 


Age Changes Vs Age Differences Vs Sociocultural Change 


Much recent controversy in the psychology of aging regarding the interpretability 
of data has centered around the conflicting data obtained from cross-sectional and 
longitudinal studies, particularly upon the topic of age decrement in intelligence 
(cf 23). 

Schaie was able to show that both cross-sectional and longitudinal methods are 
special cases deducible from a general developmental model (281, 282). A third 
possible approach is the time-lag method, which compares samples of the same 
age at different points in time. These three methods represent confoundings, two 
at a time, of the three components of age, time of measurement, and cohort (time 
of birth). Results from any two methods of descriptive analysis can therefore agree 
only under rather unusual circumstances. Since neither cross-sectional nor longi- 
tudinal methods can yield unambiguous measures of age differences (cross- 
sectional) or age changes (longitudinal), it was recommended that new mixed 
strategies involving the replication of either cross-sectional or longitudinal studies 
be considered to permit the segregation of variance due to chronological age from 
that attributable to cohort differences in cross-sectional studies and from non-age 
related environmental effects of longitudinal studies [see (286) for an heuristic 
example]. As a consequence, it was proposed that many age differences reported in 
the literature should more parsimoniously be interpreted as generational differ- 
ences (284) and that-results from single-cohort longitudinal studies were probably 
uninterpretable for most psychological variables (285). It was further suggested 
that studies hypothesizing the age decrement model required data analyzed via 
the cohort-sequential method (two or more cohorts tested at two or more common 
ages), that studies assessing the stability model required the cross-sequential meth- 
od (two or more cohorts tested at two or more common times of measurement), 
and that the assumption of a decrement-with-compensation model requires 
analysis via the time-sequential method (two or more ages examined at two or 
more common times of measurement) (288). 

Baltes (15) examined the general developmental model, and in addition to 
suggesting caution in its interpretation, concluded that since two of the three 
components of developmental change are of necessity confounded, it would 
suffice to utilize only two sequential strategies termed cross-sectional and longi- 
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tudinal sequences (cf 19). This argument has been examined by Buss (58), who 
agrees with Baltes, and by Wohlwill (347), who agrees with Schaie.* 

Discussion of the general developmental model and the empirical studies em- 
ploying sequential strategies which have been stimulated thereby suggests that 
researchers on adult development are concerned with at least three distinct issues: 
age changes, age differences, and sociocultural change. 

The researcher interested in age changes wishes to find lawful relationships that 
describe ontogenetic change across the life course of a species; these relationships 
either generalize across cohorts or yield functions that vary for successive cohorts 
in a predictable fashion. In this instance, the conventional longitudinal method 
will provide insights only for the specific cohort under study unless we subscribe to 
the irreversible-decrement model. Otherwise, either the cross-sequential or the 
cohort-sequential method would be required before generalization is possible. 

The investigator concerned with age -differences wishes to determine in what 
manner individuals of different ages differ in behavior at one point in time. If it 
does not matter whether such difference is due to age or generational differences, 
then the cross-sectional method is perfectly applicable. However, if we wish to 
consider either the stability or the decrement with compensation model in ex- 
plaining age differences, it then becomes necessary to employ the cross-sequential 
or time-sequential methods. 

But many investigators in the psychology of aging are not really interested in 
aging per se. Rather, they wish to detérmine whether there are generational 
differences which explain differential behaviors of the young and old at some 
point in time. These investigators are really concerned with the effect of sociocul- 
tural change, which in the absence of age changes can best be studied by the 
cross-sectional method, or in their presence by the time-sequential method. 

Studies of adult development which have applied the new sequential strategies 
in the area of intelligence include (199, 224, 290, 292, 296, 297); in the area of 
visual illusions (101); and in the social-personality domain (259, 294, 350). But 
several major ongoing studies (e.g. the Boston Normative Aging Studies, Duke 
Longitudinal Studies, NIH Intramural Longitudinal Study) have been redesigned 
to permit sequential analyses, and the next few years will see the publication of 
findings based on firmer ground than the data thus far at hand. 

Cohort analysis has also received much stimulation by recent advances in the 
sociological literature (261), and has begun to affect the social-psychological 
literature in conjunction with the above-described considerations in develop- 
mental psychology. Methodologically sophisticated treatments of intergeneration 
differences are presented by Bengtson & Black (30), Bengston and Kuypers (31), 


4A further analysis of the general developmental model and the “Baltes-Schaie 
controversy” is provided by Botwinick (48). Baltes and Schaie have recently concluded that 
their apparent controversy represents an incomplete explication of the model. They agree 
that the several data matrices suggested in the original paper (281) are useful, but that a 
number of different extensions of the model may make it more powerful (289). 
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Klecka (175), Edwards (94), and Palmore & Whittington (235). Methodologically 
less satisfactory attempts (because they essentially disregard the confounding of 
age and generations) dealing with the psychological aspects of intergenerational 
problems are reported by Garsee (120), Kalish & Johnson (166), Wake & Spora- 
kowski (335), and equally limited analyses of the “generation gap” are provided 
by Cameron (65) and by Fengler & Wood (110). 


Statistical Procedures for the Analysis of Aging Data 


A number of recent advances in the application of multivariate procedures to data 
involving change over age and/or time are of interest to gerontologists. Cattell (68) 
has extended his data box to provide for analyses across ages and cultures. 
Detailed treatments of longitudinal factor analyses are provided by Bentler (33) 
and by Corballis & Traub (78). Criteria for matching factors over occasions are 
examined by Nesselroade & Baltes (222). The latter authors (20) also provide a 
detailed account of procedures and problems in the developmental analysis of 
individual differences on multiple measures, and Buss (59) has investigated an 
extension of Cattell’s data box to age changes and differences. The problem of 
regression to the mean has been investigated conceptually by Furby (118) and 
empirically by Nesselroade, Schaie & Baltes (224). An extension of latent structure 
analysis to developmental data is advocated by Murray, Wiley & Wolfe (216). 
Finally, much useful material may be found in Wohlwill’s chapter on correlational 
methods in the study of developmental change (347, pp. 248-88). 


Functional A ge 


Dissatisfaction with chronological age as the independent variable in geronto- 
logical studies has led to an examination of the concept of functional age. First 
discussed by Birren (38), this concept suggests that individuals age at different 
rates along different dimensions, and that one might estimate an individual’s 
functional age very much along the lines of the old mental age concept. Functional 
age would be a convenient method to apply to problems of variable retirement. 
However, studies of the'concept in an industrial context reveal that few individuals 
are significantly ahead or behind their calendar age on a variety of measures (89, 
90). The functional age strategy has been most recently revived by Nuttall (230). 
Data based upon application of the functional age concept to psychological 
variables have recently been presented for the measurement of social age (267), 
abilities and personality (116), and for interpersonal and self-concept variables 
(168). It should be noted that the functional age concept requires assumption of 
the decrement model of aging and consequently may be inappropriate for many 
psychological variables over the major portion of the adult life-span. 


Prediction of Death 


More compatible with an adult stability model are the discussions of terminal drop 
and the prediction of death from changes in cognitive function. Here it is argued 
that changes in performance and personal adjustment within individuals occur as 
precursors of death. Because of the complexity of psychological processes, the first 
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indications of systemic breakdown might therefore be detected long before severe 
physiological decrement leading to a death trajectory can be reliably detected. 
This argument has been most clearly advanced by Riegel & Riegel (258). Data 
from retrospective analyses suggesting that such terminal drop might indeed occur 
anywhere between 1 and 5 years prior to death are presented by Lieberman & 
Caplan (195), Riegel (256), Blum & Jarvik (42) and by Reimanis & Green (255). 
Other discussions of the differential characteristics of survivors in longitudinal 
studies and the psychological concomitants of longevity may be found in Eisdorfer 
& Wilkie (98), Powers & Bultena (251), Hall et al (132), Bartko et al (26), Jarvik & 
Blum (158), Pfeiffer (242), Rose (266) and Savage (280). See also the methodo- 
logically oriented volume by Rose & Bell (268). 


Experimental Manipulation of the Aging Variable 


Although much gerontological research requires quasi-experimental designs, 
there is an increasing attempt to bring aging phenomena under laboratory control 
by means of behavior modification strategies (149, 185, 263). These studies suggest 
that there may be reversibility for many phenomena considered in the past to be 
irreversible and overdetermined developmental trends, and thus would seem to 
reopen many issues regarding possible roles of the old in society (also cf 287). A 
related trend is the cautious investigation of age simulation studies (17, 20). 
Finally, there have been a number of reviews of strategies to be applied to the 
experimental study of aging variables (22, 39, 126, 142). 


BOOKS 


Numerous books covering a wide variety of material have recently been pub- 
lished. These may be divided into those dealing with general life-span issues (72, 
81, 127); specific topic approaches covering the life-span (22, 153, 202, 223), those 
dealing with adulthood and aging in general (12, 25, 40, 48, 60, 61, 74, 96, 171, 181, 
233); and those on specific topics of aging (29, 54, 138, 160, 234, 268, 283, 341). Of 
specific interest to those concerned with methodological issues are books by 
Goulet & Baltes (127).and Nesselroade & Reese (223). The series on issues in 
developmental psychology (22, 127, 223) promises to be most useful for those 
interested in life-span approaches, while the product of the APA Task Force on 
Aging (96), which represents an effort to include all the major fields of the 
psychology of aging, should be of interest to those desiring broad general coverage 
on this area. 


COGNITIVE PROCESSES 


The past 5 years hopefully have seen the last accumulation of cross-sectional 
studies of intellectual function which by use of inadequate samples and the 
inappropriate comparison of young and old samples with very different charac- 
teristics report data on decrement which, as indicated earlier in this chapter, ought 
at best to be considered evidence for generational age differences. Detailed dis- 
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cussions of these issues may be found in Baltes.& Labouvie (18) and Botwinick 
(48). For a brief nontechnical summary, see Baltes & Schaie (23). 

From the evidence now accumulating, it appears that the earlier notion of an 
irreversible decrement model for intelligence can no longer be upheld. Indeed, 
there is reason to believe that given a suitably favorable environment there may be 
gain in certain intéllectual functions into very old age. Apparent decrement in the 
longitudinal studies appears to relate to substantial population differences in 
environmental aspects correlated with intelligence, which have either become 
more favorable for younger cohorts, or which are amenable to compensatory 
intervention. 

In this section some of the recent findings on confoundings which have led to 
earlier misleading interpretations of the status of intellectual development in 
adulthood will be considered. For the record, we will then mention a few of the 
more interesting follow-up studies, where we must note that their results may be 
cohort specific. In addition, we will consider some novel studies on age differences 
in intelligence, noting that they most likely have little or nothing to say with regard 
to aging within individuals. 


Differential Pattern of Adult Development 


Studies of adult development with global IQ estimators such as the Army Alpha 
and the Wechsler scales (cf 209) conceal patterns of change because they provide 
averages of functions which may have different slopes or are of questionable 
factorial complexity. Formal models of differential development have therefore 
been conceptualized primarily in the context of the fluid versus crystallized intel- 
ligence model (144-146). It appears that the life course of the acculturated crys- 
tallized intelligence factors does not show any decrement whatsoever, while fluid 
dimensions which depend upon species-specific and probably biologically me- 
diated functions tend to decline as a corollary of decrement in speed of capacity of 
response (83, 145, 284). Moreover, evidence is beginning to emerge which suggests 
that the pattern of adult intelligence may emerge prior to early adolescence (113). 

Far less certain are differential findings for the verbal and performance parts of 
the Wechsler, since we do not know whether the discrepancy in older subjects is 
due to differential decline or differential initial performance levels in successive 
generations. 


Ontogenetic Versus Generational Change 


A number of studies with the Primary Mental Abilities have now conclusively 
demonstrated that actual decrement for Verbal Meaning, Space, Reasoning, and 
Number does not occur until the late sixties, and that the point of decline is 
shifting upwards with increasing asymptotic level for successive generations. This 
information is provided in short-term longitudinal studies over 7 and 14 year 
periods within cohorts (290, 296). It has also been verified in studies with inde- 
pendent random samples which control for the effect of repeated testing and loss 
of representativeness in a longitudinal study (292, 297). These results are main- 
tained also when appropriate controls are introduced for the effects of subject 
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attrition (24, 291). Factor analysis and cross-occasion factor matching designs for 
separating ontogenetic and generational change have also been used to show that 
the life course for the second order factors of crystallized intelligence, cognitive 
flexibility, and visualization is stable or incremental throughout adulthood, while 
the life course for visuo-motor flexibility suggests age decrement in early adult- 
hood, decelerating in rate in old age (224). Similar findings occur when differen- 
tial life courses are charted for individuals differentiated by initial level or 
performance, with past reports of differential change being attributed to re- 
gression effects (21). 

It should be noted that the story is far less clear for Word Fluency for which 
previous studies have noted significant within-generation decrement (284). Most 
recent findings here suggest that there are significant secular trends which indicate 
systematic decrements or increments not depending upon the subject’s age, but 
rather upon the time period which is being monitored (290, 292). 


Comparability of Factor Structure 


A most frustrating problem remaining to be solved is the question of stability in 
factor structure. Only a beginning has been made to consider the possibility that 
the organization of abilities in old age might differ from that of the young (135, 
224, 247). Whether or not this will lead to the construction of alternate test forms 
for individuals of advanced age remains to be seen. 


Factors Influencing Intellectual Performance 


In addition to the discovery of generational differences, there has been a concerted 
effort to identify parameters which account for the fact that under certain condi- 
tions the elderly can show better performance than is generally expected. The 
effect of cautiousness and other motivational ability-extraneous factors upon 
intelligence-test performance has been investigated by Botwinick (45) and by 
Furry & Baltes (119), who show that situational factors and instructional set may 
indeed markedly affect intellectual performance (also see 112, 186). Favorable 
performance differentials in intelligence test performance for measures of fluid 
ability were noted for adults engaging in physical fitness exercises (250). Similarly, 
favorable performance on WAIS measures were noted for individuals with rela- 
tively low blood pressure (343), those with a lesser incidence of systemic disease 
(344), and fewer negative neurophysiological indicators (337). The effect of phys- 
iological systems on intellectual performance is discussed in more detail by 
Eisdorfer (95) and Jarvik & Cohen (159). But when short-term longitudinal studies 
are conducted for reasonably heterogeneous groups, it is again clear that socio- 
cultural differences rather than age seem most important (274). 


Follow-up Studies 


A number of single cohort studies have again been reported upon during this 
period. In general they show little or no age decrement except for the period of 
very advanced age or during the terminal period shortly before subject’s death 
where that variable has been controlled for. Worth mentioning in this group is a 
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further follow-up on the Berkeley Guidance study at midlife (27, 141); further 
reports on the survivors of the Kallman-Jarvik twin study (41, 43); and a histori- 
cally interesting follow-up of one of the early age-difference studies with the 

Babcock-Levy test (121). ) 


Cross-Sectional Studies 


A few cross-sectional studies are mentioned here because they investigate age 
differences with instruments not previously applied to aged populations. This 
group of studies investigated a Spanish version of the WAIS (130); the General 
Aptitude Test battery (117); the Draw-a-Man test (299); the Quick test (193); the 
Stroop Color Word Test (36); and Stein’s Symbol Gestalt and the Wechsler 
Memory Test (295). Witkins’ field dependence-independence measures finally 
have been investigated in relation to intelligence by Erlemeier, Schreiner & 
Schnurer (107). 


Concept Attainment and Problem Solving 


Research on concept attainment has recently taken a different turn because of a 
new focus of interest on the performance of older persons on Piagetian tasks. 
Although Piaget (114) postulates that the last stage of intellectual growth, formal 
operations, is reached at about the age of 12, several investigators have found 
consistent and reliable differences in the stage of conceptual development of 
young, young adult, and older subjects (237, 273, 313, 329). These studies seem to 
support the proposition that with age there is a curvilinear development on 
Piagetian tasks (143, 200, 273). It should be remembered, however, that although 
these investigators are postulating a developmental their evidence is based 
on age differences. 

In attempts to find other factors which might Seen for age differences on 
Piagetian tasks, several variables have been investigated. Sex differences are not 
significant (273, 313) and neither is vocabulary (313). The effect of years of 
education is unclear since one study found a strong relationship (238) and the 
other did not (273). In a critical analysis, Riegel (257) contends that Piaget’s theory 
does not adequately represent the thoughts and emotions of mature and creative 
persons and therefore is inappropriate for describing them. He feels that use ofthe 
dialectic approach is more appropriate than Piaget’s schema to interpret qualita- 
tive aging phenomena. 

Problem solving has not received much attention recently. Older people nor- 
mally perform more poorly than do younger (87, 88, 219, 262, 353). Denney & 
Denney (87) suggest this may be the result of the failure of older people to base 
their questioning strategies upon any classification system. Strategies have also 
been observed on tasks requiring guessing future events. Sanford & Maule (279) 
found on two experiments that neither young nor old subjects adopted the best 
strategy, but the old performed more poorly than the young; yet Sanford, Griew & 
O’Donnell (278) found no age difference in performance on prediction tasks 
offering two or three alternatives. 
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Learning 


Studies of learning and memory. normally employ cross-sectional designs which, 
as stated above, seriously limit our understanding of the changes with age which 
occur in these processes. Since cohort differences have not been investigated, the 
particular contribution of this effect is presently unknown. Description is, there- 
fore, limited to age differences. 

Relatively few recent studies have been devoted to age differences in learning. 
Although learning ability had long been felt to decline with age, the current 
viewpoint holds that observed deficits in learning may be the result of perfor- 
mance differences rather than as a function of a decreased ability to learn. One 
such performance factor was reported by Eisdorfer, Nowlin & Wilkie (97), who 
found that the learning of older subjects was improved by the administration of 
propranolol. The results were interpreted as supporting the hypothesis that the 
observed learning deficit in the aged is the result of heightened arousal of the . 
autonomic nervous system. A more behavioral approach related to this hypothesis 
demonstrated that reinforcement was effective in decreasing omission errors and 
the number of trials to criterion (190), and that both social and objective rewards 
improve performance (154). 

Another factor often used to determine age differences in performance is stim- 
ulus pacing (cf 48). In general, the faster the stimulus pacing, the poorer the 
performance of older relative to younger persons. One study which held the 
anticipation interval constant and looked at the total time available for learning 
rather than at the number of trials to criterion did not find a significant pacing 
effect (173). When the anticipation interval is varied, differential age differences 
did emerge (212). When interference effects were also manipulated, however, the 
anticipation interval was no longer important (331), but interference effects im- 
paired the performance of older people (187, 331). Biofeedback conditioning, a 
task which may equate age groups in terms of interference effects since it is novel 
to both, was learned as well by old subjects as by young (349). Older subjects learn 
concrete items better than abstract items, and the strategies they use to learn them 
differ from those employed by young subjects (272). An inefficient learning set has 
been hypothesized to account for some of the differences (211). In addition, older 
subjects perform poorly on tasks requiring conceptual shifts (220, 221), although 
there are conflicting results (77). 


Memory 


Although conceptually different, learning and memory are inextricably linked— 
how can one remember something which has never been learned? As has been 
found with learning performance, the ability to remember seems to decline with 
age, although not under all conditions. Consequently, investigations have been 
undertaken to ascertain where the differences exist, and, perhaps more impor- 
tantly, to try to identify what factors may account for the observed decrement 
(cf 8). 
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Age deficits are normally found on paced tasks (172, 318), tasks which demand a 
constant switching of attention (80), tasks which require a change of set (332), and 
tasks which require free recall rather than recognition (133). Since several studies 
have found that the search through memory seems to be much slower in older 
people (6, 7, 106), Anders, Fozard & Lillyquist (7) postulate that the less rapid 
access to memorized information handicaps the performance of older people and 
may be a factor in the observed decrements. 

Another approach to identify underlying mechanisms which may affect mem- 
ory processes as one ages involves perceptual processing. Harwood & Naylor 
(134) assessed the maximum rate of information acceptance of young and old 
subjects on three tasks of graded difficulty and found it to be lower in the elderly 
than in the young. Surwillo (314), investigating simple and choice reaction time, 
concluded that old age is accompanied by a slowing of information processing. 
Similarly, studies of dichotic listening show that with age there is a decrease in 
performance (302). Clark & Knowles (76) found an age-related decrease for both 
written and vocal recall and for the digits reported either first or second in 
sequence. This deficit was found predominantly in the material which had been 
presented to the left ear, but was not a function of a selective left-ear hearing loss, 
and may decrease the amount of information that reaches the short-term store and 
is thus available at output. 

Several studies have confirmed that the modality in which the stimulus is 
presented has a significant effect on observed memory deficits. Older people seem 
to be disproportionately impaired on visual as opposed to auditory recall tasks 
(108, 318). In an investigation also concerned with auditory storage, Laurence & 
Trotter (188) studied the effect of homophones, acoustically similar words, 
acoustically unrelated words, and blocked vs random listing of the items in a free 
recall test. Homophones were recalled best and acoustically similar words were 
recalled least by both young and old. Surprisingly, the elderly seemed to recall 
more words under the random list structure than under the blocked. These 
findings were interpreted to support the proposition that the elderly cannot make 
as efficient use of structural redundancy as can the young. ` 

One group of studies investigated the opportunity of organizing the material to 
be remembered. The position taken is that the observed age differences may be the 
result of inefficient organizational procedures by the elderly (124, 150-152). The 
role the nature of the task plays in influencing retention is still uncertain. Studies 
report differential age effects with a retroactive inhibition paradigm (332) and 
transfer tasks (330), although the latter study attributes this to better performance 
by the young subjects rather than poorer performance by the old. In another 
report, bright older people were subject to forgetting because of interference from 
contextual associations (51). Other studies found no differential interference 
effects as a function of age when subjects were matched for intelligence (169) and 
learning performance (210), although one study found no differential effect in two 
experiments until subjects were equated by a control criterion (321). Goulet (125) 
discussed the various methodological and theoretical issues involved. 

Meaningfulness of the material may result in age differences in memory. But 
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Kinsbourne (172) did not find that the old performed differently from the young 
when the task was to recall items which varied in the sequential dependencies 
between successive letters. The performance of both age groups improved as the 
items more closely approximated written English. On the other hand, older sub- 
jects seemed to perform better as the meaning and familiarity of the stimulus 
increased (134, 147, 148). Taub (317) found both young and old subjects per- 
formed more poorly as the complexity of the stimulus arrangement increased, but 
that the decreases were similar for both, while Howell (147) found the older 
subjects performed differentially worse than young on more complex tasks. Re- 
peated testing or practice may improve the memory of older people (240), but 
there is no evidence that it aids the elderly any more than the young (319, 320). 
Attempts to improve learning (53) and memory (85) by manipulating biological 
mediators through the administration of yeast RNA have not been successful. 
However, the administration of pentylenetetrazol was found to lead to improve- 
ment (189); nevertheless, administration of drugs may have a placebo effect (218). 
Also, hyperoxygenation seems to lead to improvements (155, 156), while alcohol 
has differential age effects (both decremental and incremental) on a variety of 
tasks (345). 


PERCEPTION AND PSYCHOPHYSIOLOGY 
Sensation and Illusions 


It has long been known that hearing decreases with age (cf 79), although the effect 
is much smaller in certain cultures (268a). Recent research on hearing and aging is 
concerned with related processing abilities. Much of this research involves work 
with speech distortion. The aging auditory mechanism has decreased capacities in 
integrating and understanding distorted messages even in persons who have rela- 
tively normal hearing audiometrically (34, 207, 253, 342). Bergman (35) sug- 
gests this may be due to problems in time-related processing abilities. Another 
approach to comparing speech perceptions involves the use of the verbal trans- 
formation effect (illusory changes which occur while listening to recordings of 
repeated words). With age there is a loss of susceptibility to this effect (338), which 
is even less noticeable in deteriorated older people (232). Rees & Botwinick (254) 
measured the sensory and decision processes in an auditory signal detection task. 
They concluded that the magnitude of auditory deficits in old age may be over- 
estimated due to the greater cautiousness of the elderly, since the aged had higher 
criterion scores than the young, while the differences in detectability were not . 
significant. The risk avoidance or greater cautiousness of the elderly has been 
found elsewhere (45) and may play a part in much of the observed performance 
decrements of older people. 

That visual function declines with age has also long been established (cf 79). 
Temporal characteristics seem to be a factor in visual perception since older 
people require longer exposure duration to identify forms even when matched 
with younger subjects on standard measures of visual acuity, and they have a 
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longer critical duration over which time-intensity reciprocity holds for form iden- 
tification (105). A somewhat different approach to looking at temporal factors 
is described by Storandt (312), who tested recognition across visual fields. Finding 
that stimuli presented in the same visual field in both training and recognition 
were recalled faster than those presented in the opposite field for recognition, she 
concluded there is a less than perfect connection via the corpus callosum; however, 
the difference is the same for both young and old subjects. 

The other modalities have not received much recent attention. Contrary to 
popular opinion, elderly subjects did not differ from young or middle-aged adults 
in the perception of comfortable temperature (265). In a study of the aging voice, it 
was found that with age there is an increase in the mean vocal intensity for reading 
and speaking and a decrease in the overall reading rate and mean sentence reading 
rate (275). It was also found that age can be systematically identified from voice 
samples (309). 

Most recent research on visual illusions involved attempts to identify factors 
which may account for the observed age differences. These included cohort and 
time of measurement differences (101), inaccurate perceptions (99), verbal intel- 
ligence (100), and sensory deprivation (122). 


Central Nervous System 


The slowing of alpha rhythm has been frequently reported as a function of age 
(325, 336, 348). Studies of behavioral correlates of temporal lobe activity have 
conflicting results. Significant relationships are found on verbal-performance dis- 
crepancies’ of relatively healthy subjects (91, 337), but not when subjects are 
matched.on such variables as age, socioeconomic status or education (337). On a 
conditioning task, measures of EEG have shown greater reactivity for the old than 
the young (310), although in cases of severe senile deterioration, diffuse slowing 
has been observed (231). The effects of EEG activity during sleep are much greater 
` than any other age-related EEG characteristic (109). The percentage of rapid eye 
movement (REM) sleep remains stable among adults until around age 80, where it 
drops markedly (109) for both males and females (165). Correlations between 
cognitive and drive functioning on amounts of REM sleep in the aged were found 
(164). Such results suggest changes in CNS functioning are involved (109, 164). 

Average evoked response from scalp recordings following the presentation of a 
stimulus show that the amplitudes of various portions of the wave form tend to 
change differentially with age for visual, somatosensory, and auditory stimulation 
(92, 203, 298). In a thorough review of this area, Thompson & Marsh (325) are led 
to the conclusion that individual differences such as intelligence, personality, and 
sex should be considered in research on evoked responses. The little available 
information on the effects of age on contingent negative variation show no age 
differences in males (206). 


Autonomic Nervous System 


Few recent studies have been devoted to behavioral correlates and the aging 
autonomic nervous system. In general, the aged show less responsivity than the 
young, as measured by the galvanic skin response (215, 310) and cardiovascular 
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responsivity (214, 310, 325), and greater reactivity as measured by free fatty acid 
mobilization (97). With one exception (186), most investigations of the arousal 
hypothesis using behavioral manipulations of levels of arousal in young and old 
subjects have not been successful in establishing differential age effects (37, 131, 
162). 


Reaction Time 


As with many other abilities, the slowing with age of response speed has long been 
recognized (cf 48). Proposing that perceptual deprivation would facilitate reaction 
time (RT) in old subjects by making them more responsive to sensory stimulation, 
Kemp (170) tested both young and old age groups but found no difference 
between experimental and control groups. On a task involving stimulus repetition 
on two-choice RT, examination of the frequency distributions led Waugh and 
associates (339) to conclude that the long latencies of the aged were a function of 
impaired psychomotor rather than decision-making efficiency. Studies designed to 
test the proposition that this loss is due mainly to sensory ability differences 
between the old and the young (46, 47, 49) suggest that the slowing is not attribut- 
able to sensory factors. Since previous results suggested motor factors were not 
important, Botwinick (47) concluded that central mechanisms underlie the slow- 
ing with age. Along this line, testing the hypothesis that EEG alpha rhythm is the 
master timing mechanism for behavior, Woodruff (348) used the biofeedback 
technique with both young and old subjects to manipulate brain wave frequency. 
She found that the slowing of the EEG alpha rhythm could not completely 
account for the slowing of RT with age (cf 324). 

Studies testing whether variations in heart rate could account for age-related 
slowing have found no relationship (50, 214, 325). But subjects with a personality 
type predisposing them to coronary heart disease had longer response latencies for 
both simple and choice RT than did subjects of a different personality type (1). 
Studies which show few age differences in response speed are quite infrequent, 
although among nonathletes, RT age differences are not observed (50). In addi- 
tion, age differences in RT disappear as a result of extensive practice, and this 
practice effect may show positive transfer to performance on ability tests (149). 


PERSONALITY 


The last few years have been particularly rich in reviews concerning our 
knowledge of adult personality development and have shown some progress in 
theory development, but they have been poverty stricken in contributions to our 
data base. Just as in the area of intelligence, we are discovering that much of the 
older literature suffers from fatal methodological flaws, that whole theories such as 
disengagement may have been built on methodological artifacts, but that the 
evidence which might help us build new models is accumulating very slowly. 


Reviews 


In her 1968 review of personality and aging, Chown (73) concluded that there is 
evidence for disengagement, but that it does not seem to occur voluntarily. She 
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described conflicting evidence, some for stability and some for change, for many 
of the popular personality questionnaires and rating methods, but she deplored 
the absence of longitudinal studies which might help clarify issues. Much the same 
tale is told by Schaie & Marquette (293), who consider multivariate studies of 
personality in adulthood and have to conclude that there is as yet little under- 
standing of personality structure beyond the college age period. Neugarten 
(226-228) is somewhat more optimistic, but would like to see a trend towards 
expanding our network of naturalistic observations (see also Neugarten 225). She 
is particularly impressed with our lack of understanding of how some people age 
successfully, and concludes that while disengagement theory may indeed describe 
real phenomena, this theory seems inadequate in handling the behavior of suc- 
cessfully aging individuals. Other more limited reviews in the personality area 
have been provided from a life-span perspective for socialization and sex-role 
development (104), creativity and cognitive style (178), person perception research 
(217), continuity models of personality (198), and the social learning approach to 
personality development (4). 


Theory Development 


There is continuing discussion on the utility of the disengagement as against an 
activity model. Of particular interest here is evidence in-cross-national studies that 
a positive relation between activity level and life satisfaction can be demonstrated 
in different urban centers (138). Similar findings are presented by Hess & Brad- 
shaw (140) for a heterogeneous group of subjects from high school to retirement 
age. On the other hand, there is some support for disengagement in the study of a 
VA domiciliary population (305). Another study suggests differentiating between 
psychological and physical disengagement (11). An attempt at incorporating cer- 
tain aspects of the disengagement model into a cognitive theory of personality is 
provided by Thomae (322), and at least one empirical study tries to relate disen- 
gagement to field-dependence (340). 

Of considerable interest is Kohlberg’s recent attempt to extend his theory of 
moral development into adulthood (179). This is the first systematic attempt since 
Erikson to extend personality theory beyond adolescence, and while the evidence 
thus far presented for the development of adult moral stages is slim (180, 183), it is 
likely that this forward thrust may lead to other ventures in the extension of 
theoretical models which in the past were not explicated beyond adolescence. 


Sequential Studies 


Methodologically more sophisticated studies are beginning to appear in the per- 
sonality literature. Woodruff & Birren (350) applied sequential methods to a 
follow-up of the California Personality Inventory first administered in 1944. They 
asked subjects to describe themselves in 1970 as they thought they were like in 
1944 and as they were like now. Their findings indicate that there were few 
objective changes over the 26 year period, but that subjects differed significantly in 
their subjective description of their personality at the earlier age. Further com- 
parison with a young control group in 1970 revealed substantial generation dif- 
ferences. Sequential methods have also been applied to a 14-year study of the trait 
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` of social responsibility (294) with apparent secular trends but no age trends, and 
to personality perceptual rigidity with similar results (290). 


Longitudinal Studies 


A number of personality variables have also been studied longitudinally in con- 
junction with studies already referred to in the section on intelligence. Much of the 
evidence from studies such as the Berkeley guidance study (cf 198) support a 
continuity model. Much of this evidence, however, as. yet reaches only into the 
middle years. A longitudinal study specifically oriented to uncover changes over 
the decade from 70 to 80 in a group of community elderly in rural Pennsylvania is 
reported in a monograph by Britton & Britton (54). These findings are frustrating 
because they do not show any clear trends, but rather suggest that over this age 
period at least, individual situational variables may be most important. On the 
matter of survival, it appears that there were no systematic relations with 
personality measures for men, but that the female survivors scored higher on 
measures reflecting activity; another blow to disengagement. Samples were quite 
small and measures many, so that this study must be viewed as generating research 
hypotheses rather than providing definitive-answers to the question of whether or 
not continuity of personality extends into very old age. 


Personality Structure . 


Several factor-analytic studies have examined adult personality structure in the 
aged, unfortunately most often with samples too small to permit sound conclu- 
sions. In this group are studies of personality and cognitive style (304) and of the 
MMPI (56). 

A somewhat different approach is taken by Goodwin & Schaie (123), who 
identified 13 personality factors analogous to those found by Cattell in a person- 
ality questionnaire administered previously to 991 subjects aged 21 to 75. Esti- 
mates of cross-sectional personality differences result in findings of stability for 
factors C, F, G, I, and M with unsystematic age trends for the remaining factors. A 
study of the Cattell 16 PF profiles of women psychologists belonging to three 
different generations showed profiles that were consistently more like men than 
women from the general population. There were some generational differences, 
with the younger cohort scoring higher in drive and self-reliance as well as being 
more impulsive (14). The relationship of anomie and cognitive structure (86) and 
of antecedent life styles and old age (103, 197, 328) have also been examined: with 
inconclusive results. 

Personality attributes and values of women past 35 years of age who were 
resuming their education were compared with women who were in the educa- 
tional process at the “normal” age. The older women were less identified with the 
feminist movement but more dominant than women currently in college (354) [cf 
also changes in timing of social roles as discussed by Neugarten & Datan (229)]. 


Miscellaneous Cross-Sectional Studies 


A number of cross-sectional studies which report age differences are mentioned 
for the record, even though again it is unclear whether their results should not 
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more parsimoniously be interpreted as evidence of generational differences. Grant 
(129) examined age differences with the Tennessee self-sort across five age levels 
from 20 to 70 and found a curvilinear trend in positive self attribution. A study of 
achievement motivation on the TAT found no systematic age differences, while 
Klein (176) found old subjects conform more than young, unless they perceived 
themselves as competent in the task (177). A study of gambling and aging showed 
old people expected to lose while young expected to win, the interpretation of 
which was based upon locus of control (311). Two studies of time-perspective in 
the elderly found situational and specific ae een variables to be more 
important than the subject’s age (204, 303). 

Other topics for which age differences between young and old adults have been 
explored during this period include the relation of egocentrism and social inter- 
action (201), hypnotic susceptibility (213), the relationship between self-concept 
and reminiscence (137, 194), factors which affect life satisfaction and morale such 
as health (84, 115, 327), socioeconomic status (44, 84, 115, 327), age integrated 
communities (249), informal social interactions with friends (93, 192, 246, 316), 
future commitments (301), and personality variables (128, 191). Many of these 
studies found that these factors are more predictive of satisfaction than is age 
change per se. In an article discussing such criteria to predict adjustment and 
disengagement, Carp (67) argues against a simple composite measure of adjust- 
ment and suggests mapping the dimensions. Maddox (205) takes this argument a 
step further, arguing that the search for a single mode of adaptation to aging is 
both empirically and theoretically unjustified. Others suggest alternatives to the 
current theoretical models (184, 351). 


ATTITUDES 


Theoretical discussions of behavior changes in old age often suggest that such 
change—and in particular the withdrawal and disengagement phenomena—might 
be accounted for by negative changes in the self-concept of the aged, which in turn 
might be mediated by effects of societal stereotypes about the aged (cf 32). These 
issues have been investigated most intensively in the context of attitudes and 
perceptions of individuals across generations (cf 333). Britton & Britton (55) 
compared second, fourth, and sixth graders on their ability to distinguish between 
pictures of young, middle-aged and old. This discrimination was successful by 
fourth graders, and while younger stimuli were consistently seen as better-looking, 
older stimuli were seen as competent and understanding. Perception of grand- 
parents has also been related to age of child, the older the child the more distant is 
the grandparent seen (163). Attitudes of the old towards themselves and the young 
in turn seem to be related to the positiveness of life satisfaction for the elderly 
subject (82, 167). Studies in which older and younger subjects rate themselves and 
others tend to produce inferred age differences which are not congruent with 
descriptions of attitudes by the separate age samples of themselves, supporting the 
notion that societal attitudes may indeed be mediated by erroneous stereotypes (3, 
28, 52, 64, 236, 269). But these stereotypes may be in the process of change, since 
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value orientations have been found to shift across generations (166, 239, 307) and 
differ in rural and urban. areas (352). Objective versus perceived age differences in 
personality were compared for adolescents, middle-aged, and old subjects. The 
middle-aged were always correctly perceived but tended to misperceive the other 
two age groups (5) [also cf review of the person-perception literature (217)]. 

There is a limited amount of information upon the effects of attempting attitude 
modification in the young towards the aging. The use of imagery has been re- 
ported with positive results (71), while didactic lectures did not effect positive 
change in medical students (75). 

Most older people are willing to talk freely on the topic of death and express no 
fear about it (174, 264). Since it has been found that people think about death 
throughout the life span more frequently than they estimate they do, death would, 
nevertheless, still appear to be a “taboo” topic (66). While euthanasia has received 
much attention in the popular press, only one study reported the attitudes of older 
people on: this topic. In general, the aged reject euthanasia, although some do 
favor omitting life-sustaining measures which merely prolong the timing of death 
(252). 


Sexuality 


Although with younger subjects there is now relative freedom to inquire about 
sexual relations, it is still fairly difficult to obtain such information from the aged 
(241). Nevertheless, some information about the sexual thoughts and activities of 
older people is available. In general, while there is a decline in sexual activity 
(245), and a reduction in thoughts involved with sexual matters (63), sex continues 
to play an important role in the lives of most older people (244), at least until age 
75 (334). Men report greater interest and activity than do women (244, 245), 
although for the women, this may be due to lack of a socially sanctioned partner 
(243). In addition, health probably plays a significant role (243,334). 


Retirement 


Most of the literature on retirement is based upon factors influencing adjustment. 
Among the factors found to lead to better adjustment to retirement are good 
health (2, 102, 139, 323), adequate income (139, 248, 323), higher level of educa- 
tion (139), preretirement planning (139), and part-time work (139, 277). 

Contrary to general opinion, health generally improves in retirement (308). 
Several studies reported that those living in retirement communities had a higher 
morale than those living in age-integrated communities (57, 208, 306). 

It was suggested that only when work holds a central organizing position in a 
person’s life should attitudes toward work influence retirement attitudes (111). 
This seems particularly pertinent for academicians who were found to continue 
their professional involvements long after retirement (2, 270, 271) and for 
teachers, although to a lesser degree (10, 136). 
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INTRODUCTION 


In this chapter the literature from June 1972 to June 1973 was to have been 
reviewed; however, the chapter on developmental psychology by Baer & Wright 
in the 1974 Annual Review of Psychology encompassed much of the research in the 
last half of 1972. In order to avoid redundancy, 1972 studies are reviewed here 
‚only when they are necessary to make.a theoretical or methodological point, or are 
part of an ongoing controversy or series of studies, or were omitted from the 
previous chapter. Therefore, this is mainly a review of the literature in the first half 
of 1973, and because there were so few related: papers on certain topics in that 
period, many of the areas traditionally covered have been omitted or given only 
cursory treatment. . 

Space limitations also led to restrictions in the range of topics and in the 
inclusion of articles within topics. No attempt was made to review the work in the 
burgeoning fields of deviant behavior in children, mental retardation, gerontol- 
ogy, and behavior modification or intervention studies. In general, a preference 
was given to problems where considerable research activity had been engendered 
because of theoretical controversies or relevance to social issues. In addition, the 
authors tended to focus on areas in which they have particular interest and 
expertise and to neglect fields in which they felt less able to do a cogent review. 
This has led to the omission of entire topics which many developmental psychol- 
ogists would regard as important, to the reporting of only the most outstanding 
studies within a field, and sometimes to the exclusion of an interesting paper 
because it did not fit into the organization of the chapter. 


LEARNING, LANGUAGE, AND COGNITION 


Learning, language, and cognition are grouped within the same major section 
because many of the questions, theoretical concerns, and research strategies dis- 
cussed are common to all three. In many cases the assignment of studies to these 
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three content areas appears arbitrary, since many studies labeled learning could 
equally as well have been labeled language or cognition and vice versa. 


Learning 


Three basic trends are found in the learning literature: first, a shift away from the 
traditional learning-conditioning orientation with its focus on demonstrations of 
the applicability of learning paradigms and explanations to new situations; sec- 
ond, a move toward: closer examination of the influences of both the child’s 
capacities and the demand characteristics of the tasks themselves on learning; and 
third, an increasing interest in both rule learning and the child’s cognitive struc- 
tures which has resulted in a convergence of issues from learning, language 
development, and cognition. 

The shift away from the traditional learning/conditioning orientation can be 
seen both in Stevenson’s (255) critical analysis of theory and research in children’s 
learning and in the current experimental reports. It is clear that three of Steven- 
son’s departures from the traditional learning orientation—focusing on the infor- 
mational value of reinforcement, emphasizing the importance of attention and 
perception for learning, and critically reexamining the mediation literature—are 
paralleled in recent research activity. 

The informational value of reinforcers and changes in their functional rela- 
tionships with age are demonstrated in Spear & Spear’s (253) examination of the 
different effects of evaluative comments on younger and older children. They 
argued that evaluative comments directed younger children’s attention to the task 
at hand, while the older children’s increased ability to regulate their own perfor- 
mance allowed them to use evaluative comments to guide their behavior.. More- 
over, Feldstein (68) showed that uncertainty reduction has an incentive value of 
its own which is independent of tangible rewards. Along a somewhat different line, 
Masters & Mokros (177) examined the distracting and satiating effects of high and 
low magnitude incentives for children. Low magnitude incentives produced both 
more rapid acquisition and greater continued utilization of the acquired response. 
In general, their results suggested that distraction plays an important role in 
determining acquisition rate, while satiation plays an important role in determin- 
ing the continued utilization of the acquired response. 

The importance of attention and perception in children’s learning is revealed in 
two studies in which the impact of the spatial separation of stimulus components 
upon central and incidental learning was examined. Wheeler & Dusek (270) found 
that the spatial separation of components resulted in central learning increasing 
with age, between kindergarten and fifth grade, while incidental learning 
remained constant. Similarly, Hale & Piper (106) found differences in incidental 
learning between stimuli having integrated or spatially separated components. 
Incidental learning increased with age, between 8 and 12 years, when the stimulus 
components were integrated, but remained constant when the stimulus compo- 
nents were separated spatially. 

The widespread critical reexamination of the mediational literature is exem- 
plified by Cole & Medin’s (53) challenge of its basic focus. They questioned the 
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need for additional demonstrations of the existence of mediation in children, 
urging instead that the future focus of mediation studies involve a more careful 
consideration of the conditions necessary for its occurrence. Although they agreed 
in general with Cole & Medin, Brown & Campione (36) differed in their assess- 
ment of the important problems in the area. In addition, they raised the issue of 
whether shift paradigms involved mediated learning or mediated transfer. 

Closer examination of the influences of both the child’s capacities and the 
demand characteristics of the tasks themselves on learning is reflected in the 
paired-associate learning literature. For example, Mallory (170) classified kin- 
dergarteners and second graders as “visualizers” or “verbalizers” and found su- 
perior paired-associate performance for the former under visual-elaboration 
conditions and for the latter under verbal-elaboration conditions. Similarly, a 
comparison of subject-provided and experimenter-provided imagery revealed 
superior paired-associate performance by both fourth and sixth graders when they 
provided their own imagery (51). The significant impact of the demand char- 
acteristics of the memory tasks themselves on children’s paired-associate per- 
formance was demonstrated by Homzie, Noyes & Lovelace (120), who found 
different patterns of results for picture pairs and picture-word pairs, depending 
upon whether recall or recognition tests were used. In general, these studies 
indicate that a child’s performance on a specific learning task can be altered 
radically by simple variations in instructions or in the task itself. 

An increasing interest in both rule learning and the child’s cognitive structures 
has resulted in several learning oriented studies of the effects of language on 
cognition. Since many of these studies were discussed in detail in the previous 
review by Baer & Wright (12), they are only listed here. These studies involved 
imitation and language behavior (45, 111, 271); training and conservation (57, 191, 
236, 245); and training and classification (46, 151, 199, 211, 243). 


Language 


The infant’s receptive language abilities, social class and ethnic influences on 
children’s language, and the relationship between language and cognition con- 
tinue to be topics of interest for many investigators. 

Most of the studies of infants’ receptive language abilities involved the effects of 
either acoustic or social stimuli on the infant’s discriminatory or vocalitory abil- 
ities. Using a sucking and habituation technique, Morse (198) demonstrated that 
40-54 day old infants were able to discriminate intonation and place of articula- 
tion cues. Focusing more on social cues, Barrett-Goldfarb & Whitehurst (14) 
found that stimulation with either father’s or mother’s voice suppressed the 
l-year-old’s spontaneous vocalizations, with greater suppression being obtained 
with the voice most preferred by the infant. Comparing social class differences, 
Tulkin (261) found greater differential responding to mother’s and stranger’s 
voices by middle class rather than working class 10-month-olds. 

Concern with social class and ethnic influences on children’s language was 
evident in this year’s literature as developmental psycholinguists continued wid- 
ening their domain to include children from a wide range of socioeconomic and 


100 HETHERINGTON & McINTYRE 


cultural backgrounds. Some of the original interests in so-called “disadvantaged” 
children arose from a suggestion that their language was in some sense linguisti- 
cally deficient and that this linguistic deficiency subsequently led to cognitive 
deficits. However, little support for this suggestion of linguistic deficiency has been 
found. For example, in a comparison of elementary school children from four 
subcultures, Hall, Turner & Russell (107) failed to find any evidence of black 
dialect interference with the comprehension of standard English. More recently, 
this original focus on linguistic deficits has been replaced by an interest in cultural 
and language differences between various social and ethnic groups. As Ginsburg 
(90) has noted, the failure of disadvantaged children in white, middle class school . 
systems may be due to cultural and language differences rather than to defi- 
ciencies. The importance of the situation and demands of the task in the study of 
cultural differences in language can be seen in a study by Jones & McMillan (135), 
in which the speech of lower and middle class children in structured and natural 
settings was compared. Numerous differences were found. The greatest was in a 
structured setting when the children were asked to describe abstract events; the 
lower class children used shorter sentences and were more context bound. 

The concern about the’relationship between language and cognition is also 
found in the study of the role of linguistic deficits in deaf children and their 
intellectual performance. Bornstein & Roy (28) criticized Furth’s (79) earlier 
conclusion that although deaf children are linguistically deficient, their thought 
processes are similar to those of hearing children, and therefore cognitive func- 
tioning must be explained without recourse to verbal processes. In essence, Born- 
stein & Roy rejected Furth’s basic assumption of linguistic deficiency in deaf 
children. In rebuttal, Furth (80) reaffirmed his conclusion and argued that deaf 
children are limited in their ability to handle linguistic rules. In an independent 
review, Bonvillian, Charrow & Nelson (25) found no evidence of intellectual 
deficiency in the deaf and argued that similar linguistic abilities underly sign 
language and spoken language. In conclusion, they appealed for greater use of 
sign language in the education of the deaf. Some empirical support for the 
effective use of sign and total’ communication in preschool programs for 
hearing-impaired children is contained in the preliminary reports of a longitudinal 
study by Moores and his associates (195, 196). 

A more purely psycholinguistic orientation to the relationship between lan- 
guage and cognition was evident in the work of R. Brown and his students (37) and 
of Slobin (250). Here the focus is on the importance of the child’s stage of cognitive 
development and real world environment on his language development. This has 
led to a shift away from purely syntactic analyses of children’s early speech to an 
increasing interest in the child’s intended meaning or semantic content. This is also 
reflected in Brown’s use of case grammars in interpreting children’s language. In 
_ addition, both Brown and Slobin are concerned with the identification of linguistic 
universals in language development. 

In a study of the word acquisition of one- to two-year-olds, which clearly 
reflects the shift in interest from syntax to semantics, Nelson (202) found that the 
names of objects which show salient features of change (e.g. ball or car) are often 
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learned first. She also found that children classified as referential or object- 
oriented speakers built larger vocabularies more quickly than children classified as 
expressive or self-oriented speakers. She interpreted her results in the context of an 
interaction model which interrelates the child’s preverbal concepts, his acquisition 
strategies, and parental-acceptance patterns. In an unrelated study, Wolff & Wolff 
(277) correlated speech and motor activity in 4- to 5-year-old children; the quality 
of speech output was correlated with gross motor activity, while the sophistication 
of verbal output was correlated with fine manipulative activity. These results are 
consistent with Luria’s (165, 166) earlier suggestions that after 4% years of age the 
child is able to use internalized verbalizations to organize sequences of motor 
activity. 


Cognition 


The work on both traditional concept formation and Piagetian tasks continues to 
flourish. A substantial number of the concept formation studies were fairly tradi- 
tional in orientation, involving closer examinations of the effects of reward (66, 
99); perceptual salience (105, 186, 192); and negative instances (122, 143). 

Interest continued in the Piagetian literature in the relationship between com- 
pensation and conservation (57, 89, 96) and the order of emergence of various 
types of conservation (31, 32, 142). Since many of the compensation studies have 
been discussed by Baer & Wright (12), only a select few of the order of emergence 
studies are mentioned briefly here. Two of these studies (32, 140) supported 
Piaget’s previous findings. However, significant inconsistencies were reported by 
Brainerd (31), who found that transitivity emerges before conservation which, in 
turn, emerges before class inclusion. He uses these results to challenge Piaget’s 
analysis of seriation. 

In an innovative variation of the basic conservation task, Singer & Kornfield 
(246) asked 5-year-olds, 7-year-olds, and young adults to make verbal judgments 
of conservation as well as to choose which container of juice (or candy) they would 
like to consume. When the verbal index was used, only the 5-year-olds failed to 
conserve. However, with the consummatory index all three groups failed to con- 
serve; they chose the container which looked bigger. Along a somewhat different 
line, Fein (67) studied children’s judgments of causality and found that “mature” 
judgments of social causality appeared earlier than similar judgments for physical 
causality. Clearly, closer consideration should be given to the importance of 
so-called social stimuli for perceptual development, e.g. mother, a social stimulus, 
may be the first object to show permanence in the infant’s perceptual world. 


INDIVIDUAL DIFFERENCES IN INTELLECTUAL 
DEVELOPMENT AND ACHIEVEMENT 


Race, Social Class, and Intelligence 


It is now 5 years since Arthur Jensen wrote his controversial paper on the herita- 
bility and possible racial differences in IQ in the 1969 Harvard Educational Review. 
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This led to a reactivation of the nature-nurture issue, attacks and counterattacks 
which were often scholarly, but sometimes emotional polemics, and an attempt to 
discredit or eliminate intelligence tests. In addition, behavior genetics became 
popular in the mass media, and the complex issues involved in the heritability and 
assessment of intelligence were simplified, sometimes in a misleading manner, for 
presentation to the public. That this controversy has had considerable impact is 
apparent in the number of schools ceasing to use intelligence tests and the legal 
and senatorial disputes about the acceptability of using test batteries as educa- 
tional or occupational screening devices, particularly in situations involving mi- 
nority groups. 

The conflict rather than abating is accelerating. At least 15 books and collections 
of readings dealing with the issue have been published in the past year. In 
addition, a vast array of papers appeared, dealing with the predictive validity of IQ 
tests, factors affecting racial and social class differences in IQ, and social, philo- 
sophical, and educational influences that have led to the concepts of intellectual 
functioning held and valued by Americans and much of Western society. 

In two volumes, Genetics and Education (132) and Educability and Group Dif- 
ferences (133), Jensen again presents his arguments that there are two qualita- 
tively different types of abilities, Level I or associative ability and Level II or 
cognitive ability. Level I ability is equally distributed across social classes and 
racial groups. He proposes that Level II ability, which is highly correlated 
with intelligence test performance and academic achievement, is differentially 
distributed and found less in lower class individuals and blacks. Thus Jensen avers 
that IQ differences between low and middle socioeconomic status children are not 
primarily due to cultural bias on tests or experiential factors but reflect genetically 
based differences in Level II intelligence, which involves abstract reasoning and 
mental transformations and manipulations rather than the simple reproduction of 
stimulus input. Genetics and Education is largely comprised of earlier published 
papers including the Harvard Educational Review paper; however, Jensen also 
reports some new research dealing with the relationship between Level I and 
Level II ability in what he calls primary and secondary retardation. He says it is 
important to distinguish between primary retardation, which involves a deficiency 
in both Levels I and II abilities, and secondary retardation, in which only Level II 
deficits are manifested. When retardation occurs in middle class children it is 
usually primary, whereas in lower class and black children secondary retardation 
is more frequent. Jensen goes on to say that IQ tests should be devised to test both 
Level I and II abilities so that secondary retardates who have difficulty in 
achieving in school, but who may later function adequately socially and econom- 
ically, can be distinguished from primary retardates who appear inadequate in 
most realms. Once identified, educational programs utilizing Level I abilities 
might be developed for the secondary retardates. 

An example of the frequently found differences in interpretation of the same 
data in this area is exhibited by contrasting views of Jensen and Mercer (187, 188) 
in discussing the fact that traditional IQ tests do not predict economic, vocational, 
and social competence for members of the lower class, but do for the middle class. 
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Whereas Jensen cites this as evidence for the differential distribution of Level II 
abilities across classes and races, Mercer presents it as evidence of cultural bias in 
standard intelligence tests. 

In Educability and Group Differences Jensen extends his genetic argument and 
reviews findings which he believes eliminate environmental, educational, eco- 
nomic, motivational, and language factors, or cultural bias in tests, as adequate to 
explain the lower IQs and achievement of blacks. He notes that American Indians 
and Mexican Americans, who on many environmental factors are more disad- 
vantaged than blacks, perform better on nonverbal IQ tests and in academic 
achievement. He also adds that the differences between blacks and whites in IQ 
increase rather than decrease with rising socioeconomic level. 

Herrnstein, in his new book, IO in the Meritocracy (115), reviews much of the 
same literature on social class differences in IQ as Jensen did, and arrives at a 
theory which differs from, but is compatible with, that of Jensen. He speculates 
that IQ, education, occupational success, and social mobility are correlated. Since 
the brighter, more successful individuals tend to be upwardly mobile and people 
usually intermarry within their own class, a genetically based meritocracy is being 
formed. He believes that with great social mobility where people can find their 
natural level, the upper classes will have an increasingly greater genetically based 
intellectual capacity than the lower classes. 

A third very disparate point of view is presented by Jencks in his book, In- 
equality (131). Jencks says educational opportunities, cognitive skills, educational 
credentials, occupational status, income, and vocational satisfaction are unequally 
distributed. Both genetic and environmental factors play a role in the development 
of cognitive inequality, but cognitive differences as measured by IQ tests do not 
seem to be as strongly related to educational success as is family background. In 
addition, although educational attainment is related to occupational status, it has 
little effect on earnings within any given occupation. On the basis of this correla- 
tional data, Jencks somehow reaches the rather pessimistic and questionable 
conclusion that educational innovations or reform will have little effect on 
inequality; that equal access to education will have little effect since black and 
lower status individuals are less likely to take advantage of educational oppor- 
tunities; and finally that economic equality is the only solution to the problem, but 
that this is unlikely to occur in the near future. 

Other contributors to this discourse maintain that blacks and lower status 
individuals will not improve their situation until qualitative rather than quantita- 
tive views of the disparate forms that abilities, competencies, and personal worth 
can assume are accepted by society (70, 163). Looft (163) argues that the schools 
are largely responsible for transmitting and maintaining narrow cultural attitudes 
toward competence and achievement, and that educational institutions must fos- 
ter a pluralistic attitude and programs that encourage and value a heterogeneous 
group of abilities and life styles. 

Research evidence continues to accumulate that many environmental, motiva- 
tional, and personality factors may affect class and race differences in per- 
formance, including differential expectations and treatment of black and white 
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children by teachers (229), greater assumption of personal responsibility follow- 
ing failure by white than black children (78, 81), and high situational test anxiety 
in disadvantaged children (283). The results of effects of race of examiner on test 
performance of black and white children continue to be inconsistent. France (75) 
and Solkoff (251) found that a white examiner did not depress performance by 
black children; however, France but not Solkoff found a black examiner led to 
lower IQ scores in white children. 

The issue of the relative contribution of heredity and environment to race and 
social status differences in IQ is unlikely to be resolved with the restrictions 
necessary in human research and the limitations of currently available research 
methodology. We cannot take samples of black and white twins and rear them in 
controlled experimental conditions. As long as any methodologically feasible 
studies have partially flawed designs, the testing of alternate models to explain 
race and social status differences in intellectual performance will be inconclusive 
and the furor will continue. 


Sex Differences in Intellectual Development, Creativity and Achievement 


The study of sex differences shares some of the problems of the investigation of 
social class or racial differences in development. The topics are emotionally loaded 
and socially significant, and the research findings are open to alternative inter- 
pretations. The results of research on sex differences are even less consistent than 
those on race and social class. A popular subject this year has been sex differences 
in intellectual development, creativity, and achievement. Sex differences are 
found in cognitive functioning and in correlates of intellectual performance on a 
wide variety of tasks. 

On: creativity tests males between the ages of 7 and 10 were found to make a 
larger proportion of original or unique responses, although girls gave a larger 
number of overall responses on any item (22, 125). In correlates of creativity boys 
and girls also were found to differ. In a study of the stability of creative ability 
between the fifth and tenth grade it was found that consistency for males was 
obtained only under conditions of group test administration and for females under 
a more personal, supportive, individual testing procedure (146). In addition, no 
correlation between IQ and creativity was obtained for girls at either age, but for 
boys by grade ten, IQ and creativity correlated. Schubert (238) also reported a 
correlation between intelligence and creativity in adult males, but this was most 
marked in the range of low intelligence test scores. However, in a study of 
kindergarten children no relationship between IQ and ideational creativity were 
found for either boys or girls (247). These findings suggest an increasing conver- 
gence with age between IQ and creativity for boys but not for girls. 

In addition to sex differences in changing patterns of relationships for IQ and 
creativity over age, the personality attributes associated with creativity differ for 
boys and girls (247). Highly creative kindergarten boys are more open, expressive, 
exploratory, and playful. In contrast, both high creative-high IQ and low 
creative-low IQ girls are inhibited, lacking in self confidence, and less effective with 
peers. 
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McCall, Hogarty & Hurlburt (180), in a reanalysis of the longitudinal Fels 
Institute data, reported sex differences in correlates of IQ with age. Girls’ infant 
test scores, in contrast to those of boys, show higher correlations with later IQ. In 
addition, infant vocalization predicts later mental performance for girls but not for 
boys, whereas frolicsome social activity in male but not in female infants is 
negatively related to later IQ scores. These results seem to support earlier findings 
of Bell, Weller & Waldrop (19) that low intensity, low reactive behavior in male 
neonates is related to advanced cognitive development at 2/4 years of age. 

In another analysis of older children drawn from the Fels longitudinal study it 
was reported that in school-aged children, activity was negatively associated with 
achievement striving in boys and positively in girls (15). There is considerable 
evidence from twin studies that social responsiveness and activity level may have a 
substantial heritable component (234, 273). Thus these genetically determined 
temperamental characteristics may be directly associated with poor intellectual 
performance and achievement striving in boys, or may elicit social feedback which 
leads to the development of these behaviors. Battle & Lacey (15) found that from 
the earliest years mothers responded negatively to the hyperactive behavior of 
their sons but not their daughters, and that their negative feelings were particu- 
larly related to critical evaluations of their cognitive development and achieve- 
ment. This was reflected in the self-doubt, anxiety, and feelings of incompetence in 
achievement reported by adolescent and adult males who had been hyperactive 
children. Thus hyperactivity in males seems to elicit negative feedback from 
mothers, which may mediate the development of attitudes and behaviors related 
to poor achievement and cognitive performance in sons. 

In addition to sex differences in correlates of early precursors of achievement 
and intellectual development, sex differences appear in relationships between 
motivation, attitudes, and achievement in the nursey school and later school years. 
The early difficulties boys experience in school are frequently attributed to dis- 
parities between cultural forces toward masculine sex role typing and the sex typed’ 
characteristics of the school and teachers. The lack of male teachers to serve as role 
models and the encouragement by female teachers of conforming, unassertive 
behavior which is incongruous with the masculine sex role are said to result in boys 
experiencing stress and conflict in the early school years and coming to regard 
academic activities and objects as feminine. 

Primary teachers report that they approve of dependent behavior more than 
aggressive behavior in both boys and girls (159). Does this mean that they treat 
boys and girls differently because boys are more likely to show aggressive behav- 
ior? Biber, Miller & Dyer (23) find that female teachers are more likely to make 
instructional contacts to girls, but that about the same proportion of instructional 
contacts with boys and girls involves positive reinforcement. This still means, in 
terms of absolute frequency of positive reinforcements, boys have fewer positive 
transactions with their female teachers than do girls. This finding is supported by a 
study of interactions of male and female teachers with preschool boys and girls 
(157) in which boys were found to receive more disapproval than girls from both 
male and female teachers. However, male teachers were much more approving of 
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boys than were female teachers, and were more likely to assign boys leadership 
positions. Boys in turn were more likely to affiliate themselves with male teachers 
and view themselves as being preferred to girls by male teachers, whereas girls saw 
themselves being preferred by all teachers. The authors conclude that male 
teachers in contrast to female teachers create classroom conditions which are more 
congenial to young boys and that except for reduced leadership positions this did 
not seem to be done at serious cost to the welfare of young girls. 

Peer influences also seem to play a role in boys’ attitudes toward school. Even 
with female teachers, boys who attend all-male classes are more likely to judge 
school-related reading items as a male activity than are boys who are in coeduca- 
tional classes (182). 

Although the school setting appears to have adverse effects for boys, it is by no 
means a constructive and stimulating one for girls. During the elementary and 
high school years the encouragement of sex typed behaviors in women begins to 
be manifested in attitudes and behavior which interfere with achievement. Nega- 
tive attitudes toward their own abilities and the development of passive behaviors 
which are incompatible with high achievement are inculcated in girls. Fifth grade 
girls underestimate the probability of their success (91) and are less likely than 
boys to persist in the face of sustained failure and to emphasize the role of effort in 
determining the outcome of their behavior (63). Girls appear to be characterized 
by learned helplessness. Although internality and acceptance of responsibility 
tend to be related to academic success for both boys and girls (52), boys who accept 
responsibility for their academic successes and girls who accept blame for their 
failures are most likely to be higher achievers. The relationship between internal- 
ity and academic performance is higher for boys than for girls, and the relation- 
ship between IQ and achievement is higher for girls than for boys (52). 

It is apparent that there are many sex differences in correlates of intellectual 
development and achievement throughout the course of development. These 
appear to be the result of complex transactions between genetic and constitutional 
factors, cultural forces involved in sex role typing, and differential responses by 
parents, teachers, and peers to the same characteristics in boys and girls (15, 23, 
157, 159). 


Cognitive Style 


Many of the studies on cognitive style which have appeared recently deal with 
problem solving strategies in children having different cognitive styles. Reflective 
in contrast to impulsive children show more mature problem solving strategies (3, 
10) and are more likely to consider several alternative hypotheses in problem 
solving and to use more efficient testing strategies (185). In addition, reflectives 
show different deployment of attention than impulsives. Although both reflectives 
and impulsives on the matching familiar figures (MFF) test use the same basic 
strategy of making comparisons between the standard and the variants, reflectives 
are more systematic and make a larger proportion of these comparisons (11). More 
selective attention in focusing on relevant aspects of a problem also is manifested 
by reflectives (108, 203). 
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Some similarities in deployment of attention between reflective and field inde- 
pendent children have been found. Field independent in contrast to field depen- 
dent children are less distracted by external cues in problem solving (174), and are 
more able to attend to relevant cues in conservation tasks (73). They also attend 
relatively more to cues within the task rather than to social cues emitted. by the 
experimenter, such as the experimenter looking at or leaning toward the correct 
stimuli in a discrimination learning task (228). 

Attempts to relate other individual difference variables to cognitive style have 
not been entirely consistent. Mann (171) found that reflectives were more cautious 
in decision making than impulsives, but this was not supported in a study of risk 
taking by Kopfstein (147). 

Campbell (43) offers some evidence for an interaction between cognitive style in 
children and maternal behavior. Mothers of impulsive children have lower 
academic expectations for their children and do not intervene or structure learning 
situations as much as do mothers of either reflective or clinically diagnosed 
hyperactive children. Hyperactive children share some common -characteristics 
with impulsive children, such as short attention spans, greater field dependency, 
and an inability to inhibit motor movement (44, 112). It might thus be expected 
that the mother-child interactions of hyperactive and impulsive children would be 
similar. However, this is not the case. Mothers of hyperactive children in contrast 
to those of impulsive children provide more direct help, encouragement, and 
impulse control suggestions during difficult tasks and are more similar in their 
behaviors to mothers of reflective children (43). 


MEMORY 


The current year was very exciting for investigators of children’s memory. Only a 
few studies were directed primarily at demonstrating the correlation of chil- 
dren’s memory capacity with their age—an already well-established fact. Most of 
the studies reported were directed toward a clearer specification of the impact of 
children’s cognitive skills on their memory capabilities (which almost caused the 
authors to label this section “Applied Cognition” rather than “Memory”). Most of 
these studies can be classified into three groups which involve either children’s 
acquisition strategies, experimenter interventions, or semantic integration. The 
first two groups are closely related to Flavell’s (71) suggestions about the impor- 
tance of production deficiencies for children’s memory defects, while the third 
reflects an increasing interest in the influence of semantic integration, a cognitive 
phenomenon first described by Bransford & Franks (33), upon children’s recog- 
nition memory. 

Children’s Acquisition Strategies 

The importance of children’s use or nonuse of acquisition strategies for their 
subsequent memory performance was investigated by Flavell and his students. 


They examined children’s abilities to use the cognitive skills necessary for direct- 
ing or monitoring their own memory behavior as well as their use of both study 
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and retrieval strategies. In one study, Appel et al (8) found no difference in 
subsequent memory between groups of preschoolers who were told either to look 
at or to memorize a set of items. They concluded that the very young child is not 
aware of any difference between perceiving and remembering, and memorization 
itself is a cognitive skill which develops in children. The knowledge which an 
elementary school child possesses about his own memory abilities was assessed by 
Moynahan (200) with a task which required children to predict the relative ease 
with which sets of categorized or noncategorized items would be remembered. 
Such knowledge, termed metamemory by Tulving & Madigan (262), was evi- 
denced by third graders. | 

In a study directed specifically at the development of study strategies, Masur, 
McIntyre & Flavell (178) investigated the age at which children begin to use the 
simple strategy of selecting missed items for study. They found that this simple 
strategy was not used spontaneously by first graders but was used by third graders. 
Similarly, in another study directed at the development of the use of indirect 
retrieval cues, Ritter et al (225) found that such cues were not used spontaneously 
by 3%-year-olds but were used by 6'4-year-olds. Evidently it cannot be assumed 
that even the simplest study or retrieval strategies are possessed by very young 
children. 


Experimenter Interventions 


The impact of experimenter interventions upon children’s memory performance 
was examined by numerous researchers. Typically, these studies involved dem- 
onstrations of increments in memory performance due to rehearsal suggestions 
(104, 230, 272), clustering cues (144, 145, 161, 244), or other organizational cues 
(110). 


Semantic Integration 


A high degree of accuracy in young children’s recognition memory for pictures 
was demonstrated by Entwisle & Huggins (65) and Scott (240). However, a sub- 
stantial reduction in the recognition accuracy of elementary school children after 
they spontaneously integrated the relationships contained within semantically 
related pictures was demonstrated by Paris (212) and Peterson & Danner (215). 
Evidently the children did not retain information about individual pictures; in- 
stead, they based their recognition judgments on agreement with “semantic inte- 
grations.” The importance of this semantic integration for memory, especially if 
memory has a reconstructive basis, is aptly reflected in Paris’ conclusion that the 
“... comprehension of the way events are related, to a large degree, underlies 
memory” (212, p. 5). 


PERCEPTUAL-MOTOR DEVELOPMENT 


A large number of studies were directed toward attentional processes in infants. 
Most of these studies reflected methodological focuses as psychologists investi- 
gating infants continued to refine their techniques. Methodological improvements 
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were suggested for the corneal reflection (231, 248, 249), visual preference (121), 
habituation (179), and cardiac deceleration techniques (239). 

The rest of the perceptual-motor development literature reflected a predomi- 
nant interest in more traditional topics, e.g. depth perception (282); shape percep- 
tion (1, 56, 129, 264); selective attention (61); and illusions (158, 220, 269). 
However, in two other topics, motoric-graphic skills and spatial reference systems, 
an increasing interest in cognitive influences on perceptual-motor development 
was manifested. 


Motoric-Graphic Skills 


A number of studies involving skilled motor activity and children’s graphic abil- 
ities were reported. Bruner (38) and Koslowski & Bruner (149) traced the ac- 
quisition of early skilled motor actions. Bruner (38) suggested that the arousal of 
motoric intention calls into play minimally organized actions which are reshaped 
by feedback into skilled actions that require only a minimum of attention. Kos- 
lowski & Bruner (149) observed five successive age-related strategies used by 12- to 
24-month-olds as they learned to use a lever. The strategies involved recombining 
and “modularizing” already existing acts. 

Just as Bruner suggested skilled actions are made up of component acts which 
are organized into a rule-governed serial order, Goodnow & Levine (98) proposed 
treating graphic behavior as syntactic behavior and interpreting age changes in 
children’s drawing abilities in terms of shifting hierarchies of rules. 

Several other studies of children’s graphic skills were reported (64, 76, 95, 97, 
222, 258). In one of these studies, Freeman & Janikoun supported Piaget’s notions 
of mental image domination of young children’s drawings when they observed 
that young children draw what they know rather than what they see. Their results 
indicated a shift from this intellectual realism to a visual realism at about 7-8 years 
of age. 


Spatial Reference Systems 


Studies of the development of children’s spatial reference systems received con- 
siderable attention during the current year. Most noteworthy among these was a 
series of investigations by Pick and his associates. In an overview paper presented 
at the 1972 APA meetings, Pick (216) discussed three concepts—space, cognitive 
maps, and frames of reference—which are intimately related to the development 
of spatial reference systems, and suggested several innovative techniques for their 
exploration. Two of these techniques were developed more fully in later papers 
presented at the 1973 SRCD meetings. In one of these (217), 4- and 5-year-olds 
were asked a series of specific questions about the spatial layout of their rooms 
(e.g. Where is the bed located?) and their homes (e.g. What room is behind this 
wall?). The differences in performance observed between 4- and 5-year-olds were 
related to differences in the frames of reference which were characteristic of their 
spatial reference systems. In the other study (2), the incidental and intentional 
memory of 3~5 and 7-8 year olds for specific environmental locations were 
compared. The results indicated that the older children spontaneously tagged 
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events with their environmental locations and made use of a frame of reference 
which was relatively independent of the location of specific objects. In slightly 
tangential but related papers presented at the same SRCD meetings, Warren (267) 
discussed the role played by early vision and blindness in the development of 
spatial reference systems, and Yonas (281) discussed the relationship between 
shading cues for depth and the development of spatial reference systems. 


ROLE-TAKING AND COMMUNICATION SKILLS 


The current year was highlighted by several theoretical and methodological ad- 
vances in role-taking research. 


Theoretical Advances 


In view of the recent debate between Chandler & Greenspan (47) and Borke (27) 
concerning the role-taking abilities of young children, the work of Flavell and his 
students is of particular interest. Recall that Chandler & Greenspan argued, in sup- 
port of Piaget, that young children are primarily egocentric and that role-taking 
abilities do not emerge until early adolescence. Moreover, they suggested that 
Borke’s (26) report of emotional role-taking abilities in young children represents 
“ersatz egocentrism” (i.e. it results from predictions based upon projection, iden- 
tification, and stereotyping). To the contrary, Borke (27) argued that children as 
young as 3 years of age possess emotional role-taking abilities and that previous 
failures to find evidence supporting her position were due to the limited response 
capacities of very young children. Moreover, she suggested that emotional role- 
taking (empathetic awareness) originates in infancy and develops in a hierarchical 
stage-like fashion. 

Independent support for Borke’s position appeared in Flavell’s (72) important 
paper which contained two major theoretical contributions: a general sequential 
model for interpersonal inference, and a more specific sequential model for the 
development of perceptual role-taking abilities during early childhood. The 
interpersonal inference model contained four sequential subcomponents: exis- 
tence—the basic knowledge that others possess covert psychological processes; 
need—an understanding that the present situation requires inferences about 
others’ psychological processes; inference—the actual inferential or computa- 
tional activity itself involved in generating estimates of other’s psychological 
processes; and application—the behavior which follows as a consequence of the 
inferential activity. The model for the development of perceptual role-taking 
abilities in early childhood involved four stage-like levels: Level 0—the child has 
no representational (symbolic) knowledge that others have visual experiences; 
Level 1—the child can represent that others see objects; Level 2—the child can 
represent qualitatively the various perspectives of objects seen by others; Level 
3--the child can represent quantitatively the retinal image another has of a visual 
array. Empirical support for the existence of these levels was reported by Ma- 
sangkay et al (172) and McIntyre et al (184), who used very simple role-taking 
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tasks to demonstrate Level 1 abilities in 3-year-olds and Level 2 abilities in 
5-year-olds. 

Early role-taking ability was also found by Shatz & Gelman (242), who report 
that 4-year-olds, who perform poorly on egocentrism tests, are capable of adjust- 
ing their speech for different listeners. In general, the speech which 4-year-olds 
directed toward 2-year-olds was simpler, shorter, and contained more attentional 
utterances than the speech they directed toward adults. Along similar lines, 
Gleason (92) looked at how children and adults change the style of their com- 
munications to fit listeners of various ages. Typically, the style changes of very 
young children involve jabbering at siblings and parents, whining at parents or 
parental figures, and falling silent in the presence of adults. However, 4- to 
8-year-olds use four basic communicative styles: baby-talk, directed toward 
babies; peer-group colloquial, directed toward siblings and friends; formal, di- 
rected toward adults and strangers; and socialization language, directed toward 
somewhat younger children. Both the amount and the quality of children’s style 
switching increases with age. 

In general, the evidence obtained from both role-taking and communication 
studies indicated that some rudimentary role-taking skills are possessed by very 
young children. Evidently Piaget’s conclusion that young children are basically 
egocentric and role-taking skills do not develop until late middle childhood or 
early adolescence was influenced by the complexity of the tasks he used. It is clear 
from the recent work with much simpler tasks that his view needs to be questioned. 
Moreover, it is also apparent that his conception of the relationship between 
role-taking and communication skills needs to be reconsidered. However, this is 
only one part of a more general need to examine closely the interrelationships 
between the various role-taking, communication, and cognitive skills of young 
children to determine whether they are reflections of a change of an underlying 
cognitive competency (e.g. egocentrism), as Piaget would suggest, or whether each 
has an ontogenetic history of its own. Rubin (227) examined this general question 
by intercorrelating the performances of elementary school children on spatial, 
cognitive, recursive thinking, and communicative role-taking tasks. Three of the 
role-taking tasks—spatial, recursive thinking, and communicative—were signifi- 
cantly intercorrelated and factor analysis revealed a general egocentrism factor. In 
another investigation of this same question, Van Lieshout et al (266) found similar 
results using. younger children (3-5 year olds) and different role-taking tasks. 
Although both of these studies suggest that there is a cognitive factor of egocen- 
trism associated with a number of role-taking tasks, they cannot be regarded as 
conclusive given the lack of supporting studies and the general lack of knowledge- 
about the difficulty and reliability of the role-taking tasks themselves. More 
studies of this question are definitely needed. 

In contrast to the predominant focus on the role-taking abilities of young 
children, Looft (164) suggested that more attention should be given to egocentrism 
across the entire life-span. He suggested that many of the characteristics com- 
monly associated with aging (e.g. rigidity, regression, and psychosocial disen- 
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gagement) are manifestations of or contributions to an increase in egocentrism in 
later life. 


Methodological Advances 


The separation of perceptual role-taking tasks into rotation, perspective, and 
perspective-move tasks was suggested by Huttenlocher & Presson (127). Rotation 
tasks require the child to predict the future appearance of an array of objects that 
will be rotated, whereas perspective and perspective-move tasks require the child 
to predict the appearance of a fixed array of objects after either someone else 
(perspective) or the child himself (perspective-move) moves with respect to it. 
Huttenlocher & Presson found the perspective task to be much more difficult than 
either the rotation or the perspective-move task. They interpreted their results in 
terms of psychological restrictions on imagination, i.e. the child’s present spatial 
position makes it difficult for him to imagine himself at another position but it 
does not hinder his ability to imagine the array in other positions. 

Several other intriguing tasks were reported during the year. Overton & Jackson 
(208) asked 3-, 4-, 6-, and 8-year-olds to perform fairly commonplace actions with 
imagined objects (e.g. pretend you are combing your hair, drinking from a cup, 
etc). A clear developmental pattern indicating the importance of motor activity for 
younger children’s imagination emerged: younger children evidenced difficulty 
with the task; somewhat older children completed the task by substituting part of 
` their body for the imagined object; and older children performed the actions with 
imagined objects without difficulty. Another developmental pattern emerged 
when Wolman, Lewis & King (278) asked elementary school children to describe 
their emotions. They found that the older children tended to refer to their emo- 
tions as being more toward the interior of their bodies and more localized in 
specific regions, especially the head-brain region. 


SOCIAL AND PERSONALITY DEVELOPMENT 


Several trends in the study of personality and social development can be detected. 
First, the impact of cognitive psychology is apparent. The interaction of cognitive 
skills, perception, attention, and judgments with social behavior are the focus of 
many studies. Second, there is an emphasis on the cultural relativism of behavior. 
Investigators are less prone to label divergent behaviors among groups as deficits 
and more likely to regard them as differences. Third, studies of the family, 
particularly of parent-child interaction, are proliferating. Finally, concern with 
social relevance has led to a modest movement toward the use of more naturalistic 
methods and an uneasy questioning of the ecological validity of the results of that 
most cherished cornerstone of psychology, the laboratory experiment. 


Modeling 


The topic in social development which continues to be the focus of the most 
research is that of modeling. Over 40 modeling studies were published in the last 
year. 
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A well-known behavior modifier once said it was time research on the applica- 
tion of operant methods passed beyond atheoretical “Whoopee!” studies 
where with each new behavior conditioned the investigator yelled, “Whoopee! 
We can condition JI" It now seems clear that under the right circum- 
stances children will imitate almost anything. Whoopee! Children will imitate 
moral judgments (140, 219, 263). Whoopee! Children will imitate altruistic beha- 
vior (189, 197, 280). Whoopee! Children will imitate resistance and yielding to 
temptation (226, 275, 276). Whöopee! Children will imitate aggressive behavior 
(54, 58, 102, 204) and self-reward (152), and social interactions (205), and per- 
ceptual judgments (59), and vocal and linguistic responses (111, 123), and concept 
formation (284, 285), and facial expressions (109), and commodity preferences 
(254), and teachers’ preferences and behaviors (69, 218), and picture preferences 
(130), and button pressing (260), and cognitive style, and Whoopee! Almost 
anything! 

It would be unfair to say that all of the above studies were just “Whoopee!” 
studies; some are well-designed studies that deal with important issues. However, 
it is unfortunate that few are developmental studies which look at age-related 
determinants and changes in imitation, and few are part of a larger programmatic 
series designed to clarify theoretical issues. 

Three issues appear to be the focus of much of the current research on imitation: 
the role of cognitive and motivational factors in imitation; the relative efficacy and 
interactions of imitation and reinforcement in modifying behavior (123, 205); and 
the generalization of imitative responses across situations and from verbal to 
behavioral responses. Only a few representative studies which seem the best 
within each topic will be presented. The research on generalization will be dis- 
cussed later under the topics of moral development and aggression which have 
been the behaviors of interest in most of these studies. 

Several of the more interesting studies deal with the role of informational cues 
in mediating imitative behavior. The presence of a common dimension (254) and 
provision of a response rule (285) increase imitative concept learning. Bandura 
(13) and Liebert & Fernandez (162) have proposed that vicarious reinforcement 
can be regarded as a type of informational cue and that it increases imitation 
because it informs subjects of consequences of behaving similarly to a model. 
Thelen et al (260) support this position by demonstrating that vicarious rein- 
forcement enhances imitation when subjects expect to perform the model’s behav- 
ior but does not when there is no expectancy to perform. 

Zigler & Yando (284) stress the interaction of cognitive and motivational factors 
in imitation in an excellent study of younger and older institutionalized and 
noninstitutionalized children. The investigators propose that as children develop 
and become more cognitively proficient, their effectance motivation leads them to 
rely more on their own cognitive processes and less on cues from others in problem 
solving. However, although imitation will decrease with age under most circum- 
stances, ambiguous situations or problems beyond their cognitive ability will 
motivate children to increase imitative responses. 

Studies of the effects of a model’s nurturance on directing attention toward 
characteristics of the model and motivating the child to imitate continue to yield 
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inconsistent results, although two well-designed studies by Yarrow and her col- 
leagues make major contributions to clarifying the conditions.under which nur- 
turance may facilitate imitation (279, 280). These studies differ from most previous 
laboratory studies of the effects of the model’s nurturance on imitation by having 
the children interact with nurturant or non-nurturant models playing the role of 
teachers in a nursery school setting over a period of several weeks. Thus the 
contact is more sustained and the setting and relationship with the child is more 
naturalistic than in most experimental studies. Under these conditions it was 
found that nurturant models were more likely to facilitate the imitation of nur- 
turant but not of non-nurturant behaviors by children, whereas non-nurturant 
models facilitated the imitation of aggressive behaviors (279). These studies make 
it clear that imitation is selective and that different characteristics of a model 
facilitate the imitation by children of different behaviors exhibited by the model. 
In addition, the models’ nurturance increased delayed imitation (279) and imita- 
tion of the models’ helping behavior in a new situation outside of the nursery 
school (280). These results in conjunction with those of previous studies using 
parents as models where warm parents were found to increase imitation of inci- 
dental behaviors in children, especially in daughters (117, 201), or studies involv- 
ing extended nurturant interactions (113) by peers, suggest that sustained warm 
relationships such as those with parents, caretakers, or peers do enhance imitative 
behavior in children. In contrast, short-term laboratory warmth manipulations 
with strange experimenters, preceding a modeling task, often fail to do so. Such 
disparities are leading investigators to question the generalizability of such lab- 
oratory studies of imitation and to ask whether these studies show what children 
can do rather than what they actually do in more naturalistic situations. 

Although in laboratory studies nurturance of the model may not increase 
imitative responses, there is evidence that non-nurturance may actually suppress 
imitative behavior. In accord with earlier findings of Patterson, Littrmhan & Brown 
(213), a recent study by Jeffrey, Hartmann & Gelfand (130) found that second and 
third grade children significantly mismatched their responses on a picture prefer- 
ence test with those of a nonresponsive model. This problem of negative imitation 
is one with great practical and theoretical significance and on which there is little 
research. 


Moral Development and Self-Control 


Over the past decade two marked changes in the study of moral development have 
occurred. First, there has been some convergence of the approaches of social 
learning theorists and cognitive psychologists. Rather than the former focusing on 
moral behavior and the latter on judgments of morality, a current interest in the 
interaction between ethical behavior and reasoning is apparent. Second, there is 
increasing interest in studies on prosocial behavior and a waning of interest in 
studies on negative aspects of behavior. There are few studies of aggression, and 
many of helping, sharing, and cooperating. This seems to be part of a general trend 
toward viewing the child positively in terms of his competencies rather than 
negatively in terms of his deficiencies. 
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MORAL JUDGMENTS Studies of children’s judgments of moral dilemmas have 
focused on methodological issues, modification of ethical judgments, and the 
relationship of judgments to behavior. 

Methodological controversies are pervasive in this area. It has been found that 
both the type and content of moral dilemmas (41, 55, 133a, 134) and mode of 
presentation of the dilemmas (48) influence the ethical judgments of children. 

The use of stories involving positive intentions, causal information about a 
wrongdoing and subsequent misfortune, and the videotaped rather than tradi- 
tional verbal presentation of moral dilemmas result in more intentionality re- 
sponses, and in intentionality being expressed at a younger age than 8 or 9 when it 
is usually first found by investigators using variations of Piaget’s original meth- 
odology. These findings suggest that in evaluating or comparing the results of 
studies of moral judgments and reasoning, both the content and mode of presen- 
tation of dilemmas must be considered. 

The second large group of studies in moral judgments deals with the effective- 
ness of models in altering the ethical judgments of children. In the 1971 Annual 
Review of Psychology, Hartup & Yonas concluded that in spite of the many 
arguments advanced by cognitively oriented psychologists against the effective- 
ness of modeling in altering moral judgments, the facts indicate that “modeling is 
implicated as a process in the development of children’s moral thinking even 
though the extent to which modeling is involved in the development of moral 
judgments and the nature of its interaction with other aspects of the child’s 
cognitive functioning are less clear” (114, p. 374). 

Little new evidence has been garnered to alter that conclusion. Some of the 
conflicting findings in this area appear to be due to the wide variations in the 
modeling procedures used, the particular aspect of moral thinking or behavior 
studied, and the interval between the modeling procedure and assessment of 
change. 

In most studies children are asked to judge the seriousness of moral dilemmas 
and to give reasons for their judgments. Keasey (140) suggests that part of the 
confusion in findings results from not distinguishing between shifts in the moral 
level of the opinions stated and that of the reasons offered to justify the opinions. 
In a study using three modeling conditions, varying opinions with or without 
reasoning at the subjects’ own level of moral development or one stage above, the 
effects of the model’s opinions and reasoning varied over time, and changes in 
moral opinions and in reasoning by the subjects were found to be unrelated. 

Brainerd (30) argued that judgments on Piagetian tasks are more appropriate 
than explanations as bases for inferences about cognitive structures since they are 
less vulnerable to Type II errors. Reese & Schack (224) responded that “a more 
reasonable conclusion is that explanations may yield more Type II errors, but 
judgments yield more Type I errors; hence a choice between them depends upon 
which type of error is considered more serious. It is recommended that both be 
used, and their interrelationship studied” (224, p 67). 

Two investigations appeared of the generalizability to behavior of the effects of 
being exposed to the ethical judgments and reasoning of the models. Turiel & 


116 HETHERINGTON & McINTYRE 


Rothman (263) found that the model’s level of reasoning had little effect on the 
subject’s subsequent reasoning, and generalizability from the model’s reasoning to 
the subject’s decision to continue punishing another person in a Milgrim-type 
obedience situation showed only stage restricted changes. The model’s reasoning 
at the stage above led to an upward shift in behavioral choice for stage 4 but not 
stage 2 and 3 subjects. Prentice (219) found that live and symbolic modeling of 
intentionality was effective in increasing intentionality responses in delinquent 
males but that the effects did not generalize to a nonmanipulated aspect of moral 
judgment, moral relativism, or to delinquent behavior during the following 9 
months. 

In view of the continuing findings of no (252) or extremely modest relationships 
(60) between verbally stated values and behavior, and of the stage specificity of 
such relationships (232, 263), it seems unduly optimistic to expect the brief 
modeling of verbalized standards to lead to marked or enduring changes in 
behavior. The lack of effectiveness of verbalized standards of a model in changing 
children’s ethical behavior is consistent with the results of laboratory studies of the 
effects of models on the self-reward or sharing and helping behavior of children, 
which will be reported in the next section. The actions but not the verbalizations of 
the model are usually found to be effective in altering children’s prosocial behav- 
ior. 


PROSOCIAL BEHAVIOR The accelerating interest in prosocial behavior is indicated 
by the devotion of an entire issue of the Journal of Social Issues (274) to positive 
forms of social behavior. Most of the research in this area has focused on sharing, 
helping and cooperative behavior. 

In studies involving subject characteristics it has been shown that older children 
are more likely to share than younger children, and that normal children are more 
likely to cooperate than retarded children (169). In addition, cultural factors affect 
prosocial behavior. Middle class boys are more likely to share on the basis of 
reciprocity than are lower class boys who tend to share on the basis of a social 
responsibility norm (62). Urban Canadian children cooperate less than rural 
Blackfoot Indians (190)..Anglo-American children are more rivalrous than 
Mexican-American children, and this cultural difference increases with age for 
boys but not for girls (136). The great emphasis on achievement and indepen- 
dent striving, particularly for boys, in middle class, urban, Anglo cultures inter- 
feres with cooperative relations with others. 

Affect is assumed to play an important role in altruistic behavior (9). When 
affect was directly manipulated by having 7- to 8-year-old children think sad or 
happy thoughts, positive affect was found to increase and negative affect to 
decrease sharing (194). Investigators have attributed an increase in both altruistic 
and self-rewarding behavior following success to the positive affect associated with 
the success experience (40, 150). However, the proposal by Underwood, Moore & 
Rosenhan (265) that self-gratification and other gratification are positively corre- 
lated is difficult to support with their finding that thinking sad thoughts decreased 
altruism but increased self-gratification. 
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In addition, in preschool children self-reward (176) and sharing with others 
(153) is apparently unrelated to the child’s perception of the adequacy of his 
performance on a task. By age 7, however, self-reward is more likely to follow a 
success than a failure experience (175). 

In a review of the literature on prosocial behavior in children, Bryan (39) 
concludes that symbolic modeling and verbalization of altruistic standards are 
most likely to increase prosocial verbalizations, whereas altruistic behavior in 
models increases prosocial behavior. In the rare cases where verbalization has 
been shown to have some effect on behaviors, such as resistance to deviation in 
children, it has been much less effective and enduring than that of behavioral 
models (276). However, even behavioral models are more effective in eliciting 
deviations and disinhibitory effects than in increasing self-control or altruism (39, 
118, 226, 275). 

Three naturalistic studies show the effects of modeling on prosocial behaviors 
(77, 279, 280). In the previously cited studies conducted in a nursery school setting, 
Yarrow and her colleagues (279, 280) found that nurturant behavior in contrast to 
symbolic modeling of nurturance was most effective in eliciting “live” altruism in 
preschool children. Only children with nurturant caretakers, who had demon- 
strated helping in both symbolic and actual distress situations, facilitated long- 
term altruistic behavior in children. The Yarrow, Scott & Waxler study (280) is 
particularly impressive since the situational generalization of modeling was as- 
sessed in an unfamiliar home setting which differed completely from the training 
situation in the school. 

In a third naturalistic study (77) in which lower and middle class nursery school 
children were shown aggressive, prosocial, or neutral films over a 9-week period, 
aggressive films increased disobedience and the inability to tolerate delay, and 
increased aggressive behavior in children who were already above the mean on 
aggression. Whereas prosocial films increased altruistic behavior only in low 
socioeconomic status children, it improved task persistence, rule obedience, and 
tolerance for delay of gratifications for both lower and middle class children. In 
this study both behavioral modeling and verbalization of standards were present 
in the films, and the relative effectiveness of the two cannot be separated. 

Many studies have demonstrated that in addition to the models’ behavior the 
consequences of the models’ actions influence self and other reward by the viewer 
(39). Sharing is facilitated not only by seeing a model punished for a refusal to 
share, but also by viewing noncontingent vicarious punishment of the model 
which has a generalized inhibitory effect on antisocial behaviors of the viewer 
~ (197). 


AGGRESSION The scattered studies of childhood aggression seem to show no 
particular focus or theoretical concerns. One wonders if the dearth of significant 
studies on this problem is attributable to a feeling by many investigators that the 
important questions about aggression have already been answered. 

As might be expected, studies on the imitation of aggression continue to appear. 
Evaluations of the models’ behavior by co-observers affected the aggressive re- 
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sponses of 10-year-old children more than 5-year-old children when they were 
alone (102). In addition, for younger but not older subjects, temporal separation 
occurring between an aggressive act and punishment of the model obscures. the 
message that aggression is negatively motivated and leads to unpleasant conse- 
quences (54). Hence the delayed punishment of the model was not effective in 
inhibiting the imitation of aggressive responses by the younger children. Chil- 
dren’s destructive play increases and constructive play decreases following realis- 
tically but not stylistically filmed aggression (204). The importance of sex of model 
and sex appropriateness of behavior is again demonstrated in a study of emo- 
tionally disturbed boys by Davids (58). Male aggressive models were more in- 
fluential in increasing the boys’ aggressive behavior, but female nonaggressive 
models were most effective in increasing nonaggressive behaviors. Female models 
were most effective in modifying verbalizations, with the nonaggressive female 
model increasing nonaggressive verbalization, and the aggressive female de- 
creasing nonaggressive verbalizations. 

In a study of 7-, 9-, and 12-year-old children, Shantz & Voydanoff (241) found 
that the tendency to respond with less retaliatory aggression to accidental than 
intentional provocation, and to verbal in contrast to physical provocations, in- 
creased with age. 

An unusual study of aggression was one dealing with “mobbing” behavior in 
Swedish school boys in which one or several boys systematically harass and use 
violence against another (206). Great stability in the behaviors of the bullies and 
the whipping boys (scapegoats) were found over a one-year period. Bullies were 
physically strong, verbally and physically aggressive, and had poor self-control; 
however, they were popular with their peers, particularly with girls. This correla- 
tion between externalizing aggression and popularity in boys was also found in a 
study of Guatemalan children (4), and has previously been reported in studies of 
lower class American boys. In contrast, whipping boys tended to be physically 
weak, fearful, unpopular, and have low self-esteem. Both bullies and whipping 
boys seemed to have more deviant parental relationships than did well-adjusted 
boys. Bullies were alienated from their fathers and felt their parents were lacking 
in affection. Whipping boys had close relationships with their parents, but the 
parents were reported to be indulgent, overprotective, and overanxious with their 
sons. 


PUNISHMENT AND SELF-CONTROL It is difficult to draw any general conclusions 
from these investigations of punishment since they study different forms of pun- 
ishment on a variety of children’s responses with subjects of varying ages. A high 
intensity buzzer appears to be the most popular form of punishment used in 
laboratory studies with children, and three studies involving noxious noise are 
reviewed below. Ä 

Consistent punishment by a noxious noise results in faster inhibition of the 
response of hitting a Bobo doll by 8- to 10-year-old boys than does inconsistent 
punishment (210), and punishment suppressed this response more rapidly than.a 
no-punishment extinction procedure. 
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In contrast, Grusec & Ezrin (103) found that reinforcement for self-criticism 
increased self-critical responses, whereas the technique of punishment, either love 
withdrawal or withdrawal of material resources in combination with a buzzer, had 
no effect. The difference in results in these two studies may be attributable to the 
differences in the dependent behaviors studied. 

LaVoie (154), using mothers and fathers as punitive agents, found that both a 
buzzer and a rationale were effective in increasing resistance to deviation in 
adolescent boys, although a rationale was more effective. The use of a rationale 
appears to be more effective in reducing deviant behavior in adolescents than with 
the younger children used in previous studies (49, 50, 155, 209). LaVoie (155) 
agrees with earlier reports by Hoffman & Saltzstein (119) in finding that mothers 
were more effective as punitive agents than were fathers and that mothers may 
play a more important role than fathers in the acquisition of self-control. He 
suggests that this may be because after the adolescent has transgressed, the mother 
is more likely than the father to let the child explain his transgression and to 
modify her discipline in response to his explanation. The punishment from the 
mother therefore might be more justifiable from the son’s point of view than 
punishment from the less responsive and less communicative father. This finding 
differs from that of a laboratory analog study of discipline which used a very 
similar resistance to temptation task and found that latency until deviation in 
second and third grade children was more affected by the commands of a dom- 
inant male than female agent. No other effects of sex or nurturance of agent 
were obtained, but dominant adults giving either permissive or prohibitive com- 
mands were more effective than nondominant agents in shaping self-control (257). 
This disparity in the effectiveness of male and female agents may be attributable to 
age differences in the subjects (perhaps mothers and females become more im- 
portant with age) or to the fact that the situation in the laboratory analog studies 
with strange adults is not analogous to responses to real parents, even within the 
laboratory. Generalizations from laboratory analog studies of buzzer punishment 
and M&M rewards administered by strange adults compared to the effectiveness 
of parental discipline practices within the home seem to be even more precarious. 
Although even within the unfamiliar and contrived laboratory setting adminis- 
tration of buzzer punishment may convey information about the demands of the 
experimental task, and rationales may convey even more information, neither 
involve the sustained, intense affective component of parental discipline within 
the home. 


Racial Awareness and Attitudes 


Although the recent literature is deluged with articles on racial differences, only a 
few studies on racial awareness and attitudes have appeared. Masangkay et al 
(173), in a cross-sectional study of 10- and 15-year-old children, show that gener- 
alized attitudes involving positive and negative evaluations of ethnic groups occur 
at an earlier age than does the attribution of specific traits associated with the 
ethnic stereotype. Traits attributed to the child’s ingroup were learned before 
those of outgroup stereotypes. 
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That this later differentiation of traits in outgroup members is at least partly 
attributable to problems in perceptual discrimination of members of other races is 
found in some of the studies from the ongoing research program of Katz (138, 
139). Katz (138) finds that young children experience more difficulty in learning to 
discriminate faces of another race. She proposes that the labeling of racial groups 
with distinctive racial names may lead to a lack of differentiation among 
individual members of the group and result in evaluative statements being more 
readily generalized to all group members. Through labeling and perceptual 
training techniques she was able to increase the perceptual differentiation of the 
other-group faces, and as perceptual differentiation improved, prejudice scores 
decreased (139). 


Sex Differences in Social Development 


Several major reviews of the literature on sex differences and gender identity have 
appeared recently. In her book, Male and Female (126), and in a provocative 
review in Human Development (125), Hutt emphasized the role of biological and 
morphological factors in the development of sex differences. In contrast, in their 
book, Man and Woman, Boy and Girl (193), Money & Ehrhardt reviewed biolog- 
ical factors in sexual differentiation and gender identity, but presented a more 
balanced interpretation of sex differences as being based on transactions between 
biological and experiential factors. . 

Hutt (124, 126) places great emphasis on the relatively greater docility, and lack 
of aggressiveness, curiosity, exploration, and creativity in females. In addition, she 
reports that females are less active and aggressive, show more proximity-seeking 
behavior toward the mother, are more interested in toys with faces, and are more 
nurturant than males. She suggests that boys are more interested in objects or 
things and girls in people. 

Maccoby and her colleagues (128, 168), in reviewing the literature and in 
investigating sex differences in toy preferences, behavior at a barrier, activity, and 
seeking proximity to the mother, arrive at very different conclusions. They report - 
that the majority of studies find no sex differences in proximity seeking or stable 
sex differences in activity level in young children, and that activity level is not a 
stable individual characteristic over time (167). Their own results tend to support 
these findings. In only one subexperiment were one-year-old boys found to be 
more active than girls. In addition, no sex differences in seeking to be near the 
mother were found (167). In contrast to the frequently cited findings of Goldberg 
& Lewis (94), when year-old children were confined behind a barrier, there were 
no sex differences in position behind the barrier or manipulation of the barrier, 
although under conditions of moderate stress girls cried more at the barrier. Sex 
differences in preferences for toys with faces were not found. In fact, boys showed 
a marked preference for a robot with a face (128). However, there is some evidence 
reported that girls may be more responsive to social stimuli than are boys. In a 
visual recognition task, 5- to 6-month-old infant girls could more readily dis- 
criminate among upright faces than could boys (66). In a study of communication 
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in grade school children, girls were more likely than boys to adapt their com- 
munications in accord with the characteristics of the listeners (7). 

One ofthe social behaviors in which there is relatively consistent evidence ofsex 
differences is in aggressive behavior. Brindley et al (35) found that two-thirds of all 
aggressive acts in nursery school were displayed by boys, and that the boys elicited 
more aggression than did girls. Girls, as is frequently reported, were more likely to 
‚use verbal aggression. These results are corroborated in a study by McGurk & 
Lewis (183) in which nursery school boys were found to be more aggressive and 
independently active and girls more help-seeking and obedient. 


Family Interaction 


Two trends in the study of parent-child relations can be seen. First, the missing 
parent, the father, has begun to appear in family studies. In the past, investigators 
seem to have focused more on the effects of father absence than on the role of the 
present father in child development. Second, investigators of family interaction 
have moved away from a reliance on parental reports of attitudes and child 
rearing practices to direct observation of parent-child interactions. 


STUDIES OF PARENT-INFANT INTERACTIONS ‘These studies focused on two topics: 
investigations of cultural differences and studies of attachment. 

Although the precocity of African infants at birth is frequently reported, War- 
ren (268), in a careful review of the literature, concludes that because of meth- 
odological deficits in such studies no such conclusion can validly be drawn. He is 
particularly critical of the measurement problems and confusion in the reporting 
of data in the frequently cited studies by Geber and Dean (82-88). He proposes 
that the most fruitful approach to this topic is to study not racial differences but 
differences in the social level or specific aspects of the cultural milieu and their 
effects on the development of the child. 

In papers reported in a symposia on Cross-Cultural Studies of Mother-Infant 
Interaction, many of the investigators appear to be utilizing the strategy recom- 
mended by Warren. Brazelton (34), in his study of Zinacanteco Indian infants of 
Southeastern Mexico, and Rebelsky (223), in her study of Dutch infants, both 
propose that although the child rearing practices utilized within these groups differ 
from those in the United States, they are appropriate for training the kind of 
people considered desirable in the culture. Although the restrictive, nurturant 
patterns of Zinacanteco mothers in caring for their infants is associated with 
imitative, nonexploratory behavior, this is adaptive in a culture in which confor- 
mity and acceptance of highly structured roles necessary for survival are valued, in 
contrast to the emphasis on individuality and independence in children in the 
United States. 

Certain cultural variations in parental stimulation of the infant may lead to 
differential development of abilities in children (100). Goldberg (93) proposes that 
accelerated development of Zambian infants on spatial ability in contrast to object 
ability, compared in American norms, may be attributable to Zambian infants 
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getting varied stimulation and constant contact by being carried in a sling by the 
mother, accompanied by little access to objects or toys for play. Lewis & Wilson 
(160) report striking social class differences in American mothers’ interactions with 
their 12-week-old infants; although the amount of overall stimulation and the 
amount of contingent stimulation did not differ, the type of stimulation did vary. 
Middle class mothers tended to respond to infant vocalizations by vocalizing, and 
to fretting by holding and touching; the reverse was true for lower class mothers. 
Greenfield (100) argues that it is the appropriateness of the mother’s contingent 
= response that is important in the development of the child, and that whereas 
vocalizing to a child’s vocal responses should lead to increased imitative vocali- 
zation in the child, vocalization in contrast to physical contact is an inappropriate 
response to fretting. 

The number of studies of attachment have diminished lately, but in a study by 
Beckwith (17), evidence continues to accumulate that infants are most attached 
and socially responsive to mothers who are attentive and respond rapidly to the 
babies’ signals, and who are permissive in permitting the infant to express his 
impulses and explore. These findings are in agreement with those in previous 
studies of attachment which emphasize qualitative differences in maternal re- 
sponsiveness as a determinant of infant social behavior (5, 235). Reinforcement 
theorists would predict that such rapid maternal responding to an infant’s cries 
might increase attachment but should also reinforce crying. However, Bell & 
Ainsworth (20) found that consistency and promptness of maternal response was 
associated not only with an increase in attachment but also a decline in frequency 
and duration of infant crying over the course of the first year of life. 

Although Bell & Ainsworth (20) report that close physical contact is the most 
effective means of terminating crying, other investigators suggest that it may be the 
vestibular-proprioceptive stimulation involved in picking up and holding the 
infant rather than contact per se which is the most important factor in soothing 
(148, 214, 259). 


PARENT-CHILD INTERACTION WITH OLDER CHILDREN The most frequently 
investigated behaviors of children in current family interaction studies involving 
children beyond infancy are intellectual STEE self-control, and moral 
development. 

Interest in the relation of social class and ethnicity and family interaction styles 
has generated several studies. Two which used both mothers and fathers studied 
the relation of socioeconomic factors and parent-son interaction. Busse & Busse 
(42) found more highly educated black fathers and mothers talked more to their 
11-year-old sons that did less educated parents. In addition, more educated 
mothers showed greater encouragement of autonomy than less educated mothers, 
and more educated fathers were more overtly affectionate than less educated 
fathers. - 

Radin (221) studied the correlates of paternal behavior and 5-year-old sons’ IQ 
in lower and middle class white families. She found that in lower class families 
paternal restrictiveness and high expectations of grades were negatively related to 
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their IQ scores, whereas in middle class families paternal nurturance, particularly 
in the form of asking information of the child, was associated with higher IQs in 
their sons. 

In contrast to these two studies which emphasized social class differences in 
correlates and patterns of parent-child interaction, Steward & Steward (256) 
found that maternal differences in styles of teaching their 3-year-old sons were 
more closely related to ethnicity than to social class. 

Lack of self-control and deviant behavior in children has been associated with 
the inability of fathers to assert their authority (6), and lack of control (101), low 
communication (154), and low achievement standards (74) in mothers. Mothers 
seem to play a more important role in their sons’ acquisition of self-control than do 
fathers. This has been attributed to the greater availability and involvement of the 
mother in discipline during the day when the father is at work, and to the greater 
communicativeness of the mother (155, 156). In a laboratory resistance to temp- 
tation task with adolescent sons, fathers were found to be more likely to intervene 
or punish with no opportunity for explanation from the son following 
transgressions. Mothers were more likely to encourage the child to explain his acts, 
discuss the implications of the transgression, and the reasons for her responses 
preceding discipline. These are the kinds of behaviors that previously have been 
found to be associated with greater self-control in children (9, 16, 119). 

The greater communication between mother and child is also found in a lab- 
oratory study of teaching styles of mothers and fathers with their daughters (207). 
In spite of the great theoretical impact that Bell’s paper (18) on the role of the child 
in shaping the behavior of parents has had, only a handful of studies have directly 
investigated this issue. The Osofsky study is an excellent example of the mutual 
shaping of parent and child responses. Parents interact more and are more con- 
trolling in teaching their preschool daughters how to do puzzles when the 
daughters are dependent than when they are independent. However, fathers are 
more likely to help the child physically or become more detached, whereas the 
mother more frequently encourages independence and explains, questions, and 
comments to her daughter. In turn, the girls are more likely to demonstrate 
task-oriented behaviors with their fathers and demands for support and attention 
with their mothers. 


Family Structure 


The two aspects of family structure which continue to receive the most attention 
are father absence and birth order. 


FATHERLESS FAMILIES Studies of the effects of father absence on child develop- 
ment have frequently involved simple-minded comparisons of the differences 
between children reared in homes with or without fathers. Usually little attention 
is paid to such factors as the time and cause of separation from the father, the age 
of the child at the time of testing, and stresses and variations in family functioning 
that might mediate any differences found in children from intact or fatherless 
families. Two recent studies of father absence consider some of these problems 
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(116, 233). Both studies found diverse effects of father absence on children ac- 
cording to whether separation had been caused by the father’s death or by divorce 
or desertion. Santrock (233) found that the adverse effects on cognitive develop- 
ment and achievement of father absence due to divorce or desertion is greatest ifit 
has occurred in the first 2 years of life for boys and girls, and death of the father is 
most detrimental if it occurs in ages 6 to 9 for boys. Boys with absent fathers 
showed lower IQ and achievement than boys from intact homes or girls with 
absent fathers. 

In one of the few studies in which the effects of father absence on the social and 
emotional development of adolescent girls were systematically investigated, few 
deviations in traditional measures of sex role typing and in relations with female 
peers and adults were found (116). This is congruent with the findings on younger 
girls from homes with absent fathers. Although relations with other females were 
normal, disruptions in interactions with males by girls with absent fathers oc- 
curred. Both girls who had lost their fathers due to divorce or due to death 
reported being tense and anxious around males, but they showed different meth- 
ods of coping with their anxiety. In contrast to girls from intact families, daughters 
of divorcees manifested more proximity and attention seeking from males, earlier 
heterosexual behavior, and various forms of nonverbal communication with 
males associated with receptiveness. However, the daughters of widows showed 
inhibition, rigidity, avoidance, and restraint around males. It was proposed that 
. experience in interacting with a warm, appreciative father helps daughters acquire 
appropriate interpersonal skills and a sense of confidence and competence in 
subsequent relationships with males. The diverse responses of daughters of di- 
vorcees and widows were associated with the aggrandized image of the dead father 
held by widows and their daughters, the negative attitudes toward the father of 
divorcees and their daughters, and the variations in the characteristics of the 
divorcees and widows and their home situation. 


BIRTH ORDER . The investigation of sibling influences and the effects of birth order 
frequently have yielded contradictory results. In two apparently weli-designed 
studies on birth order, family size, and intelligence, one conducted in the United 
States (181) and one in the Netherlands (21), completely antithetical results were 
obtained. In the former study, no associations between IQ, family size, and birth 
order were found, whereas in the latter study, performance on the Raven Pro- 
gressive Matrices decreased as family size and birth order increased. Although the 
age and nationality of the subjects and the tests used in the two studies differs, 
in both studies the confounding effects of social class and birth order with family 
size were carefully controlled. It is difficult to explain such gross discrepancies in 
results. 

In a third study of Catholic families in Ireland (141), where it is assumed that 
family size is less likely to be confounded with social status, a measure of modified 
ordinal position was obtained by dividing each child’s ordinal position by the 
number of children in the family. This reduced but did not eliminate the associa- 
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tion between family size and ordinal position. A multiple regression coefficient 
indicated that social status, family size, and modified ordinal position were related 
to verbal reasoning ability. However, as is true with most studies in this area in 
which statistically significant results are obtained, the effects of ordinal position 
account for only a small portion of the variance in intellectual performance, and 
the practical significance of these findings must be questioned. 

A fairly consistent finding that sex role typing in younger boys is influenced by 
the sex of the older sibling is supported by two studies. Bigner (24) found that 
preschool children showed sex role preferences which accorded with the sex of 
their older sibling. In a second study of college students, males with an older sister 
reported themselves to have homosexual inclinations, sexual repugnance, and 
avoidance of sexual relationships (137). 

It has been reported that firstborn children employ more high-power persuasive 
techniques on their younger siblings, while laterborns employ more low-power 
persuasive techniques on their elder siblings. In a well conceptualized study Bragg, 
Ostrowski & Finley (29) show that in previous studies, birth order and age of the 
target are confounded. When these effects are varied, the major differences in 
persuasive techniques are found to be attributable to the age of the target and not 
the birth order of the subject. 

Finally, in a critical review of birth order effects, Schooler (237) concludes that if 
one allows for the fluctuations over time in population statistics, there is almost no 
reliable evidence for birth order effects among males. Although her argument is 
convincing, and biases in long-term population trends can explain many of the 
birth order findings, the studies used in her review are highly selected. In her 
section on differences in parental approaches to children of different birth ranks, 
she cites only three studies of parental reports of their treatment of their children 
and covers none of the recent laboratory or observational studies of parents 
‘Interacting with their neonates or older children in which marked birth order 
effects have been obtained. In addition, the extensive work of Sutton-Smith and 
Rosenberg is never mentioned. 


FINAL COMMENT 


Perhaps the most marked feature in the field of developmental psychology is the 
lack of satisfactory theories of child development. Many investigators seem to 
have coped with this problem by doing completely atheoretical research; others 
are busy patching, mending, and modifying old theories; and some are building 
mini-theories that deal with very restricted areas of behavior. Although some 
modest theoretical convergence between areas is occurring, notably in the in- 
creased awareness of the. role of cognitive factors in a variety of behaviors, one 
comes away from a review of the literature feeling that developmental psycholo- 
gists working in different areas don’t talk to each other. Little systematic theoretical 
research is being done; more concern is focused on methodological than theoretical 
refinements; few attempts are being made to organize and integrate the morass of 
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findings on child psychology that we now have. The literature is replete with 
highly redundant, often trivial research or “single shot” studies that add little to 
our understanding of developmental processes. It seems to be an inefficient ap- 
proach to the study of children’s behavior. Although the familiar cry “to get it all 
together” seems to be asking an impossibility, the time is ripe to get some of it 
together. The current need is for a careful analysis, synthesis, and evaluation of the 
information we now have and an attempt to evolve theories which will result in 
more systematic and fruitful strategies of research. 
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He appointed the moon for seasons: the sun knoweth his going down. Thou makest 
darkness, and it is night: wherein all the beasts of the forest do creep forth. The young 
lions roar after their prey, and seek their meat from God. The sun ariseth, they gather 
themselves together, and lay them down in their dens. Man goeth forth unto his work and 
to his labor until the evening. 


Psalm 104: 19-23 


INTRODUCTION 


The rhythmic or cyclic nature of animal behavior has been appreciated by man 
since early in recorded history (158). Formerly, attempts to understand daily 
cycles of activity, annual reproductive rhythms, and the seasonal occurrence of 
migration and hibernation emphasized changes in the external environment as the 
causative or driving forces underlying periodicity. This view predominated well 
into the present century and assigned a more or less passive role to the organism 
displaying the rhythm; only within the last quarter century have biological clocks 
within living organisms been widely recognized as major determinants of behav- 
ioral and physiological rhythms. The potential adaptive significance of these 
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rhythms and their probable mode of evolution have been considered in several 
reviews (69, 124, 150). ù 

Psychologists have long been interested in the timing of behavior and have 
struggled with behavioral periodicity in their theories of motivation. At issue here 
is why a given animal is active at one time and quiet at another; why it eats, 
copulates, sleeps, or fights at one time of day and not at another. In general, animal 
behaviorists within the American psychological tradition have not given adequate 
weight to the role of biological clocks in their answers to these questions. Thus, one 
of our main goals in this article is to emphasize the importance of biological clocks 
for the general understanding of behavior and more particularly in the design and 
interpretation of studies concerned with behavioral phenomena. We have dealt 
primarily with vertebrate and especially mammalian behavior, but have not at- 
tempted to cite all the relevant recent literature. Limitations of space have often 
forced us to cite a single review article rather than several original research reports. 
We have emphasized daily and seasonal cycles in behavior and physiology and 
have not considered rhythmic components of human performance, nor the in- 
volvement of biological clocks in periodic illness. 

A number of books on biological rhythms are of special interest to students of 
animal behavior. Broadest in scope are the volumes edited by Aschoff (9) and by 
Menaker (125), and the reviews by Cloudsley-Thompson (42), Luce (113), and 
Bünning (37). The most recently published volume of The Neurosciences contains 
several articles of interest and provides an excellent introduction to biological 
rhythm research (127, 152, 189). Several more specialized monographs describe 
human circadian rhythms (43, 46) or individual research programs (170). Articles 
on biological rhythms are found in scores of scientific periodicals; two recently 
founded journals, Journal of Interdisciplinary Cycle Research and Chronobiologia 
are devoted to reports on biological periodicity. 


Definition of Terms Used 


The novice is frequently intimidated by the unfamiliar mathematical symbols 
used to describe parameters of rhythmic behavior. The analogy between biologi- 
cal clocks and physical oscillators has prompted the wholesale adoption of a 
complex terminology and methods of data analysis (148); these are useful but not 
essential to an understanding of basic principles. The main concepts and symbols 
encountered are listed below; this brief description is based on a report by Aschoff 
et al (16). LD cycle: Alternating periods of light and darkness (or brighter and 
dimmer phases) whose durations are given as in LD 14:10. This designates a cycle 
of 14 hr of light, alternating with 10 hr of no measurable light (darkness). The 
intensity of illumination during the respective phases is often indicated. 

LL, DD: Respectively, continuous illumination and continuous darkness. 
Zeitgeber: A periodic environmental factor that entrains a biological rhythm; the 
LD cycle is the paramount example of a zeitgeber. _ | 

Endogenous rhythm: One requiring no periodic environmental input to maintain 
its periodicity; a self-sustaining oscillation. 
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Entrainment: The coupling of an endogenous rhythm to a zeitgeber such that 
both display the same period (synchronization). 

Period: The duration of a single cycle; i.e. the time between successive recurrences 
of a specified phase of a cycle. Designated as r for the biological rhythm and T for 
the zeitgeber. | 

Frequency: The reciprocal of the period; i.e. if r = 0.5 hr then the frequency is 2 
cycles per hr. 

Phase: The instantaneous state of an oscillation. Any point within a cycle. 

Phase angle: The value on the abscissa corresponding to a particular phase of a 
cycle; this is given as some fraction of the total cycle (hours, degrees, radians) and 
is represented by ¢ for a biological rhythm and ® for a zeitgeber. 

Phase angle difference: The difference between corresponding phase angles of two 
coupled oscillators. 

Phase shift: A displacement of an oscillation along the time axis, designated A 
and A® for biological rhythms and zeitgebers, respectively. A shortening of the 
period is a phase advance while a lengthening is a phase delay. 

Free-running rhythm: A self-sustaining rhythm not entrained to any zeitgeber and 
therefore displaying its spontaneous period, usually under constant conditions. 
Circadian rhythm: An endogenous rhythm with a period of approximately 24 hr. 
The range of deviation permitted for classification as a circadian rhythm is un- 
defined, but limits between 21 and 27 hr are usually suggested. 


- CIRCADIAN RHYTHMS 
General Considerations 


The analogy between circadian clocks and physical oscillators has been fruitful 
both in generating empirical research and in providing a theoretical framework 
for biochronometry (9, 152, 200). The basic principle of this analogy is that 
rhythmic behavioral and physiological functions can be analyzed in terms of the 
known properties of linear (pendulum-like) and nonlinear (capacitor-like) physi- 
cal oscillators (201). There is now abundant evidence that multicellular organisms 
contain. many oscillators, each of which may have different characteristics and 
periods. A human subject under constant environmental conditions may simul- 
taneously display free-running periodicities of 33 hr duration for the sleep-wake- 
fulness cycle and of 25 hr duration for the rhythm in body temperature (12). 

A single rhythmic function (e.g. locomotor activity) may reflect the operation of 
two or more independent oscillators that normally are strongly coupled but may in 
fact show different natural frequencies and different sensitivities to light (152). 
One oscillator can only be phase-advanced by light and the other only phase- 
delayed. Together these oscillators can account for effects that have hitherto been 
interpreted in terms of a single oscillator with differentially sensitive phases. The 
dual-oscillator model is supported by the “splitting” of an apparently uniform 
rhythm into two independent components. Under special constant conditions, viz. 
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very dim illumination for tree shrews (Tupaia belangeri) (90) or very bright light 
for hamsters (Mesocricetus auratus) (152) or after hypothalamic lesions in the latter 
species (see 206a) these oscillators may become dissociated and free-run relative 
to each other. Starlings (Sturnus vulgaris) may show similar splitting of their ac- 
tivity rhythms when injected with testosterone (75a). Other evidence supporting 
this model includes the effects of contracting “skeleton” photoperiods (152) and 
the ubiquity of bimodal activity patterns throughout the animal kingdom (11). 

The gross behavioral measures commonly used and the strong tendency for 
oscillators to remain coupled have prevented the more frequent detection of 
multiple oscillators for each measurable rhythmic function. The integrated cyclic 
behavior of multicellular organisms can be conceived as resulting from a complex 
interaction among oscillators possessing circadian and noncircadian frequencies; 
the neurophysiological basis for the interaction within and between oscillatory 
constituents remains unexplored (152). 

Much of what is known about circadian clocks has been learned from observing 
the effects of light on rhythmic behaviors. Some of these relations have been 
summarized by Aschoff (9). His Circadian Rule states that as the intensity of 
constant illumination increases, the frequency of the free-running rhythm in- 
creases for day-active species and decreases for night-active animals. These effects 
predict the behavior of oscillators under entrained conditions as well; thus any 
manipulation that increases the frequency of the circadian clock relative to that of 
the zeitgeber will cause the animal’s oscillator to phase lead the zeitgeber by 
increasing amounts [e.g. this will occur in L-active animals when light intensity is 
increased (10)]. These principles have been confirmed in many species and dis- 
confirmed in several others (88). 

The Circadian Rule is useful in interpreting the effects of various experimental 
manipulations. For example, a treatment which “decreases nocturnality” in a LD 
cycle may in fact decrease the r of the underlying activity rhythm and thereby 
cause a phase advance relative to the LD cycle. Alternatively, the treatment could 
increase or decrease sensitivity to light in the entrainment pathway, and thereby 
alter the phase angle of activity onset without effecting a reduction in rhythm 
amplitude per se. 

Kavanau (101) has suggested that species are physiologically adapted to be 
active at a particular “optimum level” of illumination; intensities above or below 
this level will decrease activity, so the linearity.implied by the Circadian Rule may 
apply only in the middle range of illumination intensity (101). Both laboratory and 
field observations support Kavanau’s position in demonstrating that many species ` 
are most active at moderate illumination intensities (twilight) rather than during 
the brightest or darkest part of the day-night cycle (42). Kavanau’s somewhat 
untraditional methods of data analysis, his measurement of activity over rather 
limited time spans, and the lack of uniformity between his analyses and those used 
in establishing the Circadian Rule weaken his criticisms of the latter (101). 
Nevertheless, his testing of animals over a broad range of light intensities, use of 
controlled twilight transitions, and investigation of a diverse array of species are all 
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features that recommend his work (102); the prediction of an optimum species- 
specific illumination intensity, frequently in an intermediate range, is worthy of 
rigorous testing. 


Functional Significance 


Biological clocks may have eyolved to time behavior in relation to the external 
environment. For example, the initiation of eclosion activity several hours prior to 
dawn is such that the emergence of adult Drosophila from pupae coincides with 
the optimal humidity level (150). Certain circadian timers can also function as true 
clocks, that is, they can be consulted at any time. The sun-compass orientation of 
birds (89) and bees (197) allows animals to modify their flight direction relative to 
the sun’s position, exactly as if they had access to time of day information; thus 
they respond to where the sun “should” be at a given time. Such information may 
be crucial for navigation during migration. Circadian clocks are used by some 
species to measure daylength and thus are useful in timing seasonal events (see 
Seasonal Rhythms). 

Timing is also crucial in relation to other organisms; predators hunt when prey 
are most active, and related species that feed on similar diets can avoid certain 
types of competition by foraging at different times of day (cf 145; 194, p. 343). 
Ostriches (Struthio sp.) provide an example of the function of daily rhythms in 
relation to camouflage. In their usual desert habitat the black-plumed male sits on 
the nest at night while the pale brown female incubates the eggs by day (119a). 
Ringdoves (Streptopelia risoria) show a similar temporal division of labor (Whit- 
man, cited in 107a). 

The internal milieu is a temporally integrated system whose continuity depends 
‚on biological clocks and their coupling as well as on feedback relations. In Dro- 
sophila longevity is considerably decreased on 21 or 27 hr LD cycles. Presumably 
cycles that differ from 24 hr cause dysfunctional internal phase relations among 
various rhythms (156). The synchronization of enzymatic activity in various or- 
gans and the synergistic actions of hormones that depend on precise temporal 
patterns of secretion are but a few examples of the need for synchronization of 
internal processes. ) 

Menaker (124) notes that very little is known concerning the adaptive signifi- 
cance of particular rhythms in particular organisms, but concludes that “rhythms 
will eventually be shown to confer many different kinds of selective advantage on 
organisms which possess them.” 


Physiological Basis for Entrainment by Light 


RECEPTORS The existence and functional significance of extraocular photore- 
ceptors in nonmammalian vertebrates is well established (127), but none of the 
adult mammals studied to date show signs of entrainment by light after removal of 
the eyes. Since only nocturnal species have been studied systematically, it remains 
possible that direct penetration of light through the skull (70, 208) and extraocular 
photoreceptors could entrain rhythmic functions in diurnal mammals (cf 126). On 
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the basis of current information it is most likely that ocular receptors. are the 
principal, if not the exclusive, mediators of light entrainment in mammals. 

The specific retinal elements involved in entrainment are unknown; whether 
unique classes of receptors underlie visual perception and entrainment by light 
also remains to be established; hitherto unsuspected receptors are apparently 
. involved in both processes. Excellent light-dark discrimination has been demon- 
strated in albino rats (Rattus norvegicus) after complete destruction of the retinal 
receptor layer by exposure to LL (5), although in one study 50% of the LL rats did 
not consistently perform above chance levels and the remaining animals required 
more trials to reach criterion than controls (22). Degeneration of photoreceptors 
induced by exposure to constant illumination of moderate intensity is usually 
complete after one month (5), and is not reversed when rats are returned to a LD 
cycle (55). The optic tract and lateral geniculate nucleus of most of these ae 
show no response to photic stimulation (143). 

Nevertheless, these animals can entrain their corticosterone rhythms to the LD 
cycle (55) and show light-dependent reproductive and pineal responses (167). It is 
not known whether these functions are performed by a few residual rods or by an 
entirely different kind of receptor (e.g. bipolar or ganglion cells). The harderian 
glands, located within the orbital cavity directly behind the eye, do not appear to 
be the photoreceptors involved in these processes (166). Reiter has speculated that 
some cellular element within the eye other than the classical photoreceptors 
transmits information to the hypothalamus and that a humoral agent secreted by 
_ the eye also may be involved (166, 167). 

According to Caley et al (39), mice (Mus musculus) with retinas devoid of clas- 
sical photoreceptors manifest circadian rhythms in feeding that differ from those 
of control animals; the latter show the usual nocturnal feeding cycles whereas 
the rodless animals reportedly have a “freely running rhythm” relative to day 
and night (39). This observation, if confirmed, would imply that entrainment of 
feeding by the illumination cycle is mediated by the rods. Unfortunately, the only 
- published records of feeding rhythms in CBA/Ki mice (110) do not support the 
conclusions of Caley et al: the feeding patterns of these animals (110, Fig. 6) 
appear to be well within the normal range for nocturnality. The existence of 
free-running rhythms would be difficult to prove given the frequency with which 
feeding was recorded. Mouse strains with various ocular abnormalities (180) 
provide an excellent and to date unexploited opportunity for the analysis of 
receptor mechanisms underlying entrainment, photoperiodicity, and visual 
perception. 


PHOTOPIGMENTS The photopigments involved in entrainment are unknown; 
systematic studies of action spectra and thresholds for entrainment are almost 
completely lacking (127). Green light is most effective as a zeitgeber for the rat’s 
circadian body temperature rhythm, with red and ultraviolet wavelengths rela- 
tively ineffective (119). Short wavelengths are most effective for phase-shifting 
the daily rhythm of torpor in pocket mice (Perognathus penicillatus) (71). The 
correspondence between the rhodopsin sensitivity curve and the efficacy of var- 
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ious wavelengths for entrainment (119) suggests that rhodopsin may be involved 
in the latter process. It would be useful to test this hypothesis on rats deprived of 
photoreceptors by prior exposure to constant light. 

Wavelengths corresponding o blue and green light most effectively influence 
locomotor activity of mice (183) and also constitute the components of white light 
mainly responsible for the degenerative changes induced by constant illumination 
(4). The possibility that a given wavelength range is maximally effective for 
entraining all rhythmic functions in a given organism is intriguing and worthy of 
experimental verification. This seems to be the case in the moth Pectinophora 
gossypiella in which effective spectral components for phasing several behaviors 
are similar but different from those involved in photoperiodic time measurement 
(155). The wavelengths most effective in mammals with good photopic vision are 
completely unknown. 

Photopigments involved in entrainment may be distinct from those that mediate 
other light-dependent functions (e.g. the visual perception of objects). Thus caro- 
tenoid depleted Drosophila have retinules grossly deficient in rhodopsin and their 
visual receptors show a 3 log-unit reduction in light sensitivity; nevertheless, the 
sensitivity of their circadian emergence rhythm to the phase-shifting effect of light 
flashes appears to be normal (205). 


CENTRAL VISUAL PROJECTIONS Light impinging on the mammalian eye almost 
certainly acts via the established visual projections to entrain the central oscillators 
underlying rhythmicity. A neurohumoral mechanism for entrainment seems at 
best a remote possibility. 

The central visual projections have been most extensively studied in the albino 
rat; information on this species is summarized by Moore (131). He notes that all 
visual input from the eye enters the optic chiasm via the optic nerve. Of the three 
pathways leaving the chiasm the largest is the primary optic tract (POT) which 
consists of uncrossed and decussating retinal components that terminate in the 
superior colliculus, pretectum, and lateral geniculate. The superior accessory optic 
tract (SAOT) runs within the POT and terminates in the midbrain tegmentum. A 
third visual pathway that leaves the chiasm, the inferior accessory optic tract 
(IAOT), consists entirely of crossed retinal fibers; after leaving the chiasm it runs 
in the ventral part of the brain along with fibers of the medial forebrain bundle 
and eventually terminates in the medial terminal nucleus of the midbrain teg- 
mentum (81). The IAOT has been traced in a number of mammalian species but 
apparently is not present in the golden hamster (56). ; 

A fourth visual pathway, the retinohypothalamic tract (RHT) provides a direct 
connection between the retina and the suprachiasmatic nuclei of the hypothala- 
mus. This projection is bilateral even in rodent strains in which all other com- 
ponents of the retinal projection are crossed (83). Input to the contralateral 
suprachiasmatic nucleus (SCN) is about twice as heavy as to the ipsilateral side; 
most retinal fibers terminate in the ventral part of the SCN where they form 
asymmetric synapses either on small dendritic branches or on dendritic spines 
(83). The distribution, receptive fields, and discharge properties of the retinal 
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ganglion cells that project to the SCN ‘are unknown (83). Although a 
retinohypothalamic pathway had previously been described (137), the definitive 
proof of its existence in mammals required the use of autoradiographic tracing 
techniques and was accomplished only within the past 4 years (83, 130, 131). The 
RHT remains difficult to detect with conventional tracing procedures such as silver 
impregnation (111); a thoughtful discussion of problems involved in this analysis . 
is contained in (83). To date the RHT has been described in mammals represent- 
ing several orders, including rodents, carnivores, marsupials, and primates (130). 
It also exists in birds (123) and other nonmammalian vertebrates (137). 

There is no firm evidence that pathways other than the retinohypothalamic tract 
are essential for entrainment of mammalian circadian rhythms. Lesions that bi- 
laterally interrupt the primary and/or accessory optic tracts do not eliminate 
steady state nocturnal drinking rhythms of albino rats maintained on a LD 12:12 
cycle (187) or locomotor rhythms of pigmented rats (3). It was also reported that 
POT transection did not affect the phase-shifting of the drinking rhythm after a 12 
hr shift in the illumination cycle (187). Subsequent observations show that lesions 
aimed at the lateral geniculate nuclei to interrupt the primary optic tracts retard 
the rate of phase-shifting of the hamster wheel running rhythm; re-entrainment 
requires 3 times as long as for controls (206a). Final steady state entrainment is 
comparable to that of unlesioned hamsters. Failure to observe similar changes 
with respect to the rat’s drinking rhythm may be due to procedural differences 
between the studies. 

Interruption of the primary and accessory optic tracts also does not interfere 
with steady state adrenal corticosterone rhythms or with the pineal rhythm of 
N-acetyltransferase (131). Entrainment of these rhythms as well as the circadian 
locomotor rhythm of rats (186) and the locomotor and drinking rhythms of 
hamsters (see 206a) is disrupted by lesions that destroy the retinohypothalamic 
projection. Knife cuts posterior to the SCN also disrupt entrainment of the adrenal 
corticosterone and. the pineal N-acetyltransferase rhythms (131, 132), perhaps by 
interrupting efferents from the SCN. Although this pathway has not been ade- 
quately described, Krieg (105) suggests that SCN axons course dorsally and then 
caudally in the periventricular system. | 

The retinohypothalamic projection is probably necessary and sufficient for 
photic entrainment of mammalian circadian rhythms, and the primary optic tracts 
may influence these processes by virtue of their indirect connections with the 
suprachiasmatic nuclei (190). 


Zeitgebers Other than Light 


Light is the acknowledged dominant zeitgeber for mammalian rhythms. In our 
own species, however, social factors may synchronize free-running rhythms (13) 
and experimental instructions can override illumination effects (142). In social 
isolation some human rhythms may free-run in the presence of “adequate” LD 
cycles to which other animals would entrain (13). Social stimuli may be more 
important synchronizers of human rhythms because we control our environment 
to a greater extent than do other mammals; this could have reduced the biological 
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significance of the illumination cycle. Caution should be exercised in drawing such 
a conclusion because animal subjects are rarely tested in social situations and the 
effectiveness of social cues in entraining their rhythms is largely unknown. The 
likely efficacy of social stimuli is indicated by the observation that nondominant 
rats in a seminatural setting modify their eating and activity rhythms to avoid 
conflict with dominant animals (40). The changes observed in animal activity 
rhythms after human disruption of their environments (101) also suggest an 
important role for nonlight factors in modifying mammalian rhythms. 

A sparrow housed in constant dim illumination has been entrained to the sight 
and/or sound of a conspecific; recorded bird song can also entrain the activity 
rhythms of some sparrows, but this zeitgeber is far weaker than an illumination 
cycle (128). The cycle of maternal behavior has been proposed as a social stimulus 
that may contribute to the development of rhythmicity in the young (1). However, 
rat pups reared with constant access to maternal care, supplied by mothers on 
opposite LD cycles, show the normal development of rhythmicity (69). 

Perhaps the most extensively documented social influences are connected with 
reproductive rhythms. In some species synchronization of estrus can be effected by 
the presence of males; female mice exposed to the odor of males tend to come into 
heat and to ovulate on the same day of the estrous cycle (34). Pheromonal 
mechanisms may also be involved in the reported synchronization of menstrua- 
tion in women who live together in college dormitories (118). There has been 
virtually no experimental exploration of the influence of conspecifics on daily 
rhythms of mammalian behavior. 

Other weak zeitgebers include electrical fields (202) and cyclic pressure changes 
(80). Some stimuli may affect specific rhythms without necessarily acting as clas- 
sical zeitgebers; e.g. limited daily temporal access to food or water rephases 
rhythms of locomotor activity (29), adrenal corticosterone secretion (98), sleep 
(134), and metabolic cycles (28). Handling of animals at the same time each day 
may induce gonadal degeneration or growth, depending on when it occurs relative 
to the animal’s day-night cycle (122). The neural mechanisms by which environ- 
mental stimuli other than light gain access to biological clocks that govern rhyth- 
mic behavior have not been delineated. 

Temperature cycles, although effective as zeitgebers for poikilotherms, are 
ineffective for mammals (161). Mammals, in common with other animals, show 
excellent temperature compensation in preserving periodicity within narrow 
limits. Although the biological clock is presumably chemical in nature, the rate 
of change of r as temperature changes (Q),) is extremely small compared to that of 
other chemical reactions (161). 


‘Neural Control 


Although circadian rhythms can occur in plants and in protozoa lacking nervous 
systems, the generation of these rhythms in metazoa probably is accomplished 
primarily via nervous mechanisms. Even among mammals, however, the adrenal 
gland (6) and the intestine (37) can display endogenous daily rhythms of secretion 
and contraction, respectively, in complete isolation from neural influences. In the 
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intact organism phasing of these rhythms is subject to control by central neural 
pacemakers. 


NONMAMMALIAN SPECIES Neural control of rhythms has been studied mainly in 
nonmammalian species, and the perspective gained from this work is useful in 
evaluating and guiding research with mammals. The optic lobes of insect brains 
have been implicated in the generation of activity rhythms and other circadian 
cycles (32, 112). The early view that a neurohumoral agent was involved in rhythm 
generation has been vigorously challenged (32) but has also received renewed 
support from the demonstration of interanimal transfer of rhythmicity in para- 
biotic cockroaches (48). The distinctive circadian rhythm of cocoon emergence 
(ecdysis) can also be transferred by brain transplants between two closely related 
moth species (192). Truman’s elegant studies showed that the timing but not the 
motor pattern of ecdysis was affected by brain transfer; this effect was accom- 
plished even when the brain was implanted in the abdomen of the pupa and 
therefore in the absence of normal neural connections. Although this suggests 
neurohumoral mediation of rhythmicity, it is important to distinguish between 
nonhumoral clock mechanisms and humoral linkage of the neural clock to motor 
systems that express rhythmical behavior. 

Circadian rhythms of neural activity have been demonstrated in a single neuron 
of the Aplysia parietovisceral ganglion (189) and in the optic nerve discharge of the 
excised Aplysia eye (94). Jacklet & Geronimo (95) cut away portions of the eye in 
vitro and observed normal circadian rhythmicity in optic nerve activity when only 
20% of the eye was left intact. When a larger portion of the eye was destroyed, the 
period of the rhythm decreased; with 2% of the eye remaining, only ultradian 
(< 24 hr) frequencies of electrical discharge were expressed. One interpretation of 
these findings is that the cells whose fibers form the optic nerve are themselves 
pacemakers independently generating ultradian rhythms and that a minimal 
number of such mutually coupled oscillators interact to produce a circadian 
rhythm of spike discharge (95). This view emphasizes circadian rhythmicity as an 
emergent property of the interaction among oscillators. The alternative proposal is 
that the reduction in the period of the Aplysia optic nerve rhythm involved the 
fortuitous destruction of crucial circadian pacemaker cells within the eye which 
synapse chemically onto optic nerve neurons (189). The issue is unresolved, but 
there is strong evidence indicating that chemical synapses are not involved and 
that oscillators in this preparation are electrotonically coupled (94). 

The avian pineal may possibly be a clock for generating activity and body 
temperature rhythms (127). Pinealectomized (pinx) sparrows display a normal 
rhythm of body temperature and perch-hopping activity when exposed to a LD 
cycle; however, in DD where normal birds generate clearcut free-running circa- 
dian rhythms, pinx birds are hyperthermic and continuously active and show no 
signs of circadian rhythmicity in these functions (26, 27). The pineal may be a 
master oscillator that hierarchically entrains other oscillators, each of which is 
responsible for circadian rhythmicity in a particular function (127); alternatively 
the pineal may be a coupling device between a master driving oscillator located 
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elsewhere and other light-sensitive damped oscillators that in turn drive overt 
circadian rhythms (127). 


MAMMALS The effect of selective lesions of the mammalian nervous system on 
rhythmicity has most often been studied only under entrained conditions. Any 
alteration in the temporal pattern of a single physiological or behavioral param- 
eter studied in a LD cycle is subject to a variety of interpretations and cannot be 
used to localize a clock within the destroyed brain region. The changes that have 
been described after brain lesions include a temporary reversal in the phasing of 
the peak in corticosterone release in dogs without olfactory bulbs (7), temporary 
elimination of the rat’s corticosterone rhythm after transection of the fornix (109), 
and elimination of this rhythm after basal hypothalamic isolation (74). Lesions of 
the ventromedial or dorsomedial hypothalamus reduce the normal day-night 
differences in feeding (18, 23), while lesions of the lateral hypothalamus accen- 
tuate the nocturnal feeding rhythms of rats (99); medial septal lesions exaggerate 
the apparent nocturnality of activity rhythms of hamsters (87) and the drinking 
rhythms of rats, and increase the rate at which rat drinking rhythms phase shift 
after a change in the LD cycle (Rusak & Zucker, unpublished observations, 1972). 
Superior cervical ganglionectomy retards the rate of entrainment of feeding be- 
havior in rats (20), and lesions interrupting the primary optic tracts of hamsters 
slow the phase shifting of the locomotor activity rhythm (206a). 

The problems inherent in such studies are exemplified by one performed in our 
laboratory (see 206a); lesions of the raphe nuclei of rats increased overall wheel 
running activity and greatly reduced the nocturnality of this rhythm. However, 
when tested in constant dim light these rats generated normal free-running 
rhythms with 7s of approximately 25 hours, as shown by power spectral analysis. 
Raphe lesions clearly increased the level of activity but did not eliminate the 
. endogenous rhythmic fluctuations around the higher mean level. Such results 
emphasize the need for sufficient time series (24, 25) and adequate statistical 
analysis of data obtained under proper experimental conditions before one may 
conclude that any neural, chemical, or other manipulation has “eliminated” the 
rhythmicity of a behavioral system. It should be emphasized that it is far easier to 
affect motor systems involved in expression of rhythmicity than the clock mech- 
anism itself. 

Richter has reported a complete and convincing loss of circadian rhythmicity of 
eating, drinking, and wheel running activity in blinded rats with hypothalamic 
lesions of unspecified size and location (171). More recently, suprachiasmatic 
(SCN) lesions have been reported to eliminate rhythmicity in pooled mean levels 
of plasma corticosterone sampled at different times during a LD cycle (132). 
Individual records of wheel running activity and drinking recorded in LD cycles 
and in constant light showed an elimination of circadian rhythmicity after SCN 
lesions (186). Our recent studies of hamsters with SCN lesions suggest a complex 
role for this brain region in the production, coordination, and entrainment of 
behavioral rhythms (see 206a). Critchlow (47) had earlier implicated the SCN in 
the control of estrous cyclicity; his observations have been extended in more 
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recent work which also suggests that the SCN is involved in the control of daily 
surges in luteinizing hormone and in the timing of ovulation in several species (38, 
76). Whether the SCN serves as a master clock, a central coupler of rhythms, or as 
one complex oscillator in a multioscillator system is unknown. The adjacent 
medial preoptic area (97) and the lateral hypothalamus (176) both show regular 
changes in spontaneous electrical activity during lengthy recording sessions; it has 
not been established that these cycles represent true circadian rhythmicity nor that 
the rhythmic activity originates independently at each recording site. 

The importance of the hypothalamus in the regulation of rhythmicity is sup- 
ported by human clinical data; patients with hypothalamic damage show a re- 
versal of normal patterns of electrolyte excretion (107). Another report (146) cites 
a loss of normal body temperature rhythms after an increase in third ventricular 
pressure which presumably interferes with the function of midline hypothalamic 
nuclei. 


Chemical Control 


CHEMICAL BASIS OF CLOCKS In multicellular organisms the output of a circadian 
oscillator may be the product of interactions among a large population of cells. 
Other cells may couple the oscillator to the external environment or to effectors 
that express the observable circadian rhythm. Chemicals may operate at any or all 
of these levels of organization. As a consequence, pharmacological or chemical 
modification of circadian rhythmicity does not necessarily imply that the oscillator 
itself has been affected. 

The period (r) of biological clocks reflects the basic process involved in time 
measurement and is remarkably intractable to a wide range of pharmacological 
agents, including inhibitors of macromolecular synthesis, anesthetics, hallucino- 
gens, poisons, and other drugs traditionally used in pharmacological research (49, 
140, 154, 170). The few agents that have affected r have done so only within narrow 
limits, and these successes have not led to a clear understanding of the chemical 
nature of biological clocks. 

Dilute solutions of ethyl alcohol (ETOH) lengthen the endogenous tidal rhythm 
of the isopod Excirolana chiltoni (58);’0.5% ETOH increases 7 by an average of 
approximately one hour and dosage dependence is apparently linear up to 1%. 
Reliable but considerably smaller lengthening of r of the hamster locomotor 
rhythm occurs when blind animals are chronically ingesting 20% ETOH (206a). 

Deuterium oxide (D,O), also known as heavy water, is the only chemical agent 
that consistently affects the period of many free-running circadian oscillators. It 
lengthens r in plants and in a variety of unicellular and multicellular animals 
(154). In the mouse r of the locomotor rhythm is increased in proportion to the 
concentration of D,O in the water supply; a maximum increase of 7.4% was 
obtained with a 30% solution. In a-LD cycle phase lags of entrainment are 
frequently recorded, and in some cases the activity rhythm “breaks away” from 
the illumination cycle and free-runs (54). Pittendrigh et al (154) have summarized 
some of the effects of D,O on biological systems. Partial deuteration slows vir- 
tually every biological process and rate depression is explainable by many dif- 
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ferent potential causes; they postulate that D,O produces its effects on circadian 
rhythms by diminishing effective cell temperature. Some preliminary evidence 
obtained with the Drosophila emergence rhythm supports this view. 

Chemical manipulation of unicellular and in vitro systems will not be reviewed 
here except to cite the frequently stated generalization that the intracellular 
production of a circadian periodicity may depend on the transmembrane potential 
and on the permeability of specific ionic channels (59, 140, 189). Alcohol, DO, 
and other chemicals (e.g. lithium) which lengthen the periods of various clocks 
have all been described as increasing the stability of biological membranes. A 
recently developed membrane model of the circadian clock (140) suggests that 
timing is the result of a feedback system involving transmembrane ion gradients 
and protein-dependent ion transport channels. This model accounts for temper- 
ature compensation of r on the basis of the temperature adaptation of the mem- 
brane lipid bilayer. The rate of ion diffusion and of horizontal movement of the 
proteins responsible for ion transport would both be compensated for temperature 
change by this mechanism. The fact that K+ pulses (59) and valinomycin, which 
causes an accumulation of K+ (37), can produce phase shifts similar to those 
produced by L pulses suggests a crucial role for Kt in the production of circadian 
rhythmicity and its entrainment by light (140). 

The existence of a general homeostasis of r in the face of all potential pertur- 
bations occurring within cells (153) might account for the transitory nature and 
small magnitude of drug induced changes in circadian systems. The variation in 7 
effected by all changes in light, temperature, and chemical stimulation is small, 
usually of the order of 5% and very rarely exceeding 10% (152). As noted by 
Pittendrigh (152, 153), the apparent homeostasis of r is a major barrier to evalu- 
ating the quantitative effects of any agent used to disturb the circadian oscillator 
and complicates assessment of the chemical nature of biological clocks. 


CHEMICAL INFLUENCES ON RHYTHM AMPLITUDE The degree to which rhythms in 
brain chemistry affect amplitude of behavioral rhythms remains to be established. 
Discrete portions of the mammalian nervous system and the brain as a whole 
display circadian fluctuations in 5-hydroxytryptamine (5-HT), norepinephrine 
(NE), and possibly in dopamine (DA). The amplitude of these rhythms and their 
phase relations to the illumination cycle are species-specific (85, 133). 

Depletion of 5-HT which is produced by injecting parachlorophenylalanine 
temporarily reduces the nocturnal feeding, drinking, and activity rhythms of rats; 
the amplitude of these rhythms recovers to control levels within one week (31, 66, 
206a). Similar effects are observed after 6-OH DA treatments that markedly 
reduce brain NE; the nocturnal activity rhythm is attenuated but not entirely 
eliminated two to three weeks after treatment (85) and normal three to four weeks 
later (182). The effects of these drugs on the free-running periods of these behav- 
iors were not reported. 

Noradrenergic mechanisms appear to link endogenous rhythms with the cycle 
of environmental illumination. The ß-adrenergic blocking agent 1-propanalol 
prevents the dark-triggered increase in rat pineal enzyme activity (53); NE de- 
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pletion also interferes with entrainment of body temperature in a monkey (Cebus 
albifrons) (203). 

The 24 hr periodicity of eating behavior observed in rats maintained in LD 
10:14 disappeared when amphetamine was added to the water supply (30). In a 
similar study rats ingesting amphetamine equivalent to 20 mg/kg of body weight 
increased their wheel running activity throughout the 24 hr period so that en- 
trainment to the LD cycle seemed to disappear (49). The hyperactivity produced 
by amphetamine also obscured the free-running rhythm of rats maintained in 
constant light. The authors conclude that amphetamine disrupted the circadian 
activity rhythm, but they provide no quantitative estimate of r; from visual 
inspection of their records we conclude that circadian rhythmicity may have 
persisted under the drug condition. . 

Removal of endocrine organs reduces the amplitude of various rhythms in- 
cluding those of body temperature (65), locomotor activity (170), and food and 
water intake (187, 188). The effects are presumably dependent on chemical- 
hormonal changes consequent to gland extirpation. None of these procedures 
appears to affect the periodicity of the underlying behavioral rhythms (160, 170), 
but see (37, p. 57) for a discussion of adrenal effects on physiological periodicities. 

This brief survey illustrates that it is far easier to alter the amplitude of bio- 
logical rhythms than to affect their periodicity. The specific chemical substrates 
underlying zeitgeber perception or coupling of oscillators to effector systems are 
_ unknown, and the possibility exists that coupling within the mammalian neural 
clock and between it and other systems may be electrotonic rather than chemical 
(cf 95). Electrotonic coupling has been described in several mammalian neural 
structures, including the visual system (22a). 


DRUG EFFECTS ON BEHAVIOR Responsiveness to drugs fluctuates with the phase 
of the organism’s circadian cycle; the timing of crests and troughs of susceptibility 
can also vary from one drug to the next and depends on the function and species 
studied. 

Mortality of rats after a single injection of amphetamine at different times of 
day reached a peak of 78% at 0600 hr and a trough of 7% at 0300 hr; these animals 
were maintained on a LD 12:12 cyele with light onset at 0600 hr (175). 
Fluctuations of a similar nature have been noted for a wide array of physiological 
parameters and for various classes of drugs. These findings are of importance in 
chemical therapeutics where it has often been tacitly assumed that animals are 
steady state systems whose sensitivity to drugs is invariant across time of day. 

An exhaustive review of research in this area (162) and our subsequent litera- 
ture search indicate that biological time of day has been manipulated as an 
independent variable in but a few psychopharmacological studies. This is unfor- 
tunate since drug effects on behavior are almost certainly as subject to modifica- 
tion by endogenous rhythms as the more traditionally investigated physiological 
parameters. Insufficient recognition of this point may account for seemingly dis- 
crepant findings (cf 117, 173). Biological time of day is a decisive determinant of 
the effects of -NE on food intake; when applied directly to the lateral hypothal- 
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amus of rats during the L phase of a LD 12:12 cycle, NE increases food intake; in 
the dark phase NE depresses food consumption (117). This effect may depend on 
circadian fluctuations in the activity and sensitivity of lateral hypothalamic neu- 
rons (176). 

The dose-response curve for increasing locomotor activity with amphetamine ts 
steeper in the L than the D part of the illumination cycle (60); sleeping time of rats 
on a LD 12:12 cycle is a function of the hour of barbiturate administration (162). 
These findings suggest the importance of controlling for or better yet manipulating 
biological time of day as a variable in psychopharmacological research. The 
meager data currently available suggest that the new findings and perspectives 
gained from such an analysis will be well worth the effort. 


SEASONAL RHYTHMS 


Photoperiodism 


Homeothermic species in temperate and arctic climates must adapt to seasonal 
changes in temperature, predator pressure, and the availability of food and pro- 
tective cover. In doing so they undergo hibernation (135), estivation, migration, 
seasonal and opportunistic breeding, and delayed uterine implantation (174, 194). 
Social reorganization, daily torpor, and changes in fur quality and coloration are 
also among the adaptations observed in mammals on a seasonal basis (96, 184, 
185). 

Physiological and behavioral responses must often be initiated well in advance 
of the seasonal conditions they are designed to meet. Hibernating mammals store 
food or body fat long before snow cover and cold make feeding impossible and 
hibernation essential for survival (194, p. 378). Birds show precise timing of 
premigratory fattening and development of functional reproductive organs in 
advance of their arrival at distant breeding grounds (62). Mechanisms that rely 
exclusively on simple responsiveness to current conditions are inadequate to 
‘account for these phenomena; animals clearly use information about time of year 
to anticipate critical conditions. Most species use the stable and regular annual 
cycle of change in daylength (photoperiod) to time seasonal events. Photoperiodic 
time measurement (PTM) is evident in the initiation of seasonal reproduction, but 
it also mediates other seasonal cycles. An understanding of the formal models 
proposed to account for PTM is essential to any consideration of research on 
seasonal rhythms. 


MODELS OF PHOTOPERIODIC TIME MEASUREMENT (PTM) 


(a) The hourglass model The mechanism whereby an organism may directly 
measure the duration of a L or D period by accumulating some endproduct during 
one phase and not the other has been compared to an hourglass. This type of 
mechanism has been confirmed in a few insects (155) but apparently is not 
applicable to mammalian PTM. 
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(b) Bünning’s hypothesis This states that an endogenous circadian oscillator 
mediates PTM. In its role as zeitgeber light first entrains a circadian photosensi- 
tivity rhythm; the second function of light is induction of the photoperiodic re- 
sponse. This occurs if and only if light extends into the photoinducible (sensitive) 
phase of the circadian cycle. If it is assumed that dawn sets a photosensitive phase 
to occur 14 to 15 hr later, then light will fall in the inducible phase only when 
days are long (in excess of 14 hr). In this way a short period of light coinciding 
with a critical photosensitive phase can induce the seasonal changes associated 
with long days; otherwise the animal will behave as ıf under short-day conditions. 
Animals need not experience light continuously for effective PTM; this is an 
attractive feature in accounting for the excellent PTM observed in fossorial ani- 
mals who may be exposed to daylight only near dawn and dusk each day (37). - 
Photoperiods of any given duration occur twice each year and PTM depends on 
the animal distinguishing 14 hr of light in the spring from 14 hr of light in the fall; 
many animals show a photorefractory period such that photoinduction by long 
days (spring) takes place only if these days are preceded by an extended period of 
short days (winter) (37). The specific form of Bünning’s hypothesis described 
above relies on the coincidence of an internal state (photosensitivity) with an 
external condition (light) and has been called an external coincidence model (151). 


(ch The internal coincidence model (151) This assumes that the sole function of 
light is to entrain various circadian oscillators such that they bear different phase 
relation to each other under different photoperiods. PTM and induction depend 
on a seasonally unique phase relation of these oscillators and no direct inductive 
role is ascribed to light. Two oscillators entrained respectively to dawn and dusk 
could serve this function (152). A corollary of this model is that entrainment by 
nonlight zeitgebers could achieve the proper internal relations among oscillators 
and thereby effectively mediate “photoperiodic” time measurement in the ab- 
sence of light (151). Such studies are currently in progress with temperature 
zeitgebers, but the predicted effect has yet to be demonstrated (152). 


(d) The resonance effect Since laboratory tests of PTM frequently use exotic 
lighting schedules, one cannot conclude that the circadian system is actually 
involved in PTM merely because the schedules in which L recurs at modulo 24 hr 
intervals are most effective for induction (152). A noncircadian mechanism might 
measure light duration but produce maximum induction only when the animal’s 
physiological mechanisms are synchronized by a cycle approximating the natural 
frequency. In general an organism is assumed to function most efficiently when its 
internal oscillators have the proper phase relations and this occurs only when 
entrained to 24 hr cycles. 


TESTS OF THE MODELS Tests of Bünning’s hypothesis typically involve noninduc- 
tive entraining light periods followed by brief “flashes” of light at various intervals 
during the succeeding dark periods. The pitfalls inherent in interpreting such 
studies derive from the difficulty in distinguishing entraining from inductive 
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effects of light (152). Elliott et al (57) recently attempted to circumvent this 
problem by concurrently recording locomotor activity and testicular development 
of hamsters exposed to such LD sequences. When the hamsters’ activity cycles 
were stably entrained to the illumination cycles their testes showed unique daily 
phases of sensitivity to the inductive effects of light; these phases recurred at 24 hr 
intervals in accord with Bünning’s hypothesis. However, these results may also be 
amenable to an explanation in terms of light entraining a dawn oscillator one day 
and a dusk oscillator on another in such a way as to maintain an inductive internal 
phase relationship, i.e. an internal coincidence model involving only entraining 
and not inductive actions of light is still not ruled out (57). 

Results obtained with the moth Pectinophora gossypiella do not support Bün- 
ning’s hypothesis. In this species, where it is possible to separate the inductive and 
entraining properties of light, no circadian involvement in PTM was observed 
(155). Replication of this experiment in mammalian species in which circadian 
systems seem to be implicated awaits the discovery of light wavelengths and 
intensities consistent with induction but not with entrainment. 

The importance of photoperiodic time measurement and the intricate interac- 
tion among biological clocks, hormones, and behavior are most clearly illustrated 
by Meier’s elegant studies on sparrows (120, 121). Injections of corticosterone and 
prolactin induce premigratory fattening, gonadal development, and zugunruhe 
(directed migratory restlessness) only when they bear a particular phase relation to 
each other. Prolactin injected 12 hr after corticosterone produces all these effects 
and the zugunruhe is directed northward (spring mode); if prolactin is given 4 hr 
after corticosterone, only fattening occurs and zugunruhe is directed southward 
(fall mode). Prolactin injections bearing other phase relations to corticosterone are 
ineffective. 

Meier’s evidence suggests that the peak in endogenous corticosterone secretion 
is followed 4 to 8 hr later by a photoinducible phase during which light can trigger 
release of luteinizing hormone (LH). A second photoinducible phase 16-22 hr 
after the corticosterone peak permits light to stimulate release of follicle stimu- 
Jating hormone (FSH). The maintenance of photosensitivity at these phases de- 
pends on entrainment of the prolactin rhythm at an appropriate phase relative to 
the peak in corticosterone secretion (121). These data are consistent with the 
following hypotheses: light entrains the circadian rhythms of corticosterone and 
prolactin release, with daylength determining the internal phase relations between 
these rhythms; when the proper phase relation occurs, it in turn entrains a circa- 
dian rhythm of photosensitivity during which light may act to influence pituitary 
release of LH and FSH. This results in spermiogenesis, testis growth, and other 
signs of spring migration, or in the fattening and zugunruhe typical of autumn 
migration. Stress produced by handling sparrows and pricking them with a needle 
at appropriate times of day can also phase the corticosterone rise so that pho- 
toinduction can take place (122). 

Careful tests of entrainment of the circadian system with nonlight cues are 
needed to improve our understanding of the control of seasonal breeding and 
migration. Such experiments could provide valuable information about the rela- 
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tive importance of the internal and external coincidence systems already impli- 
cated by Meier’s findings. They could also provide insights into the physiological 
mechanisms whereby many internal oscillators can be entrained and phase shifted 
relative to each other and to environmental cycles. 


Circannual Rhythms 


Endogenous annual oscillations (circannual rhythms) provide a second mechan- 
ism, independent of photoperiodic time measurement, for timing of seasonal 
events. To qualify as endogenous, a circannual rhythm must persist under constant 
environmental conditions and occur year after year with a free-running period of 
approximately one year. It is hardly surprising that few studies have adequately 
satisfied this demanding criterion. 

The existence of such endogenous annual oscillations has also been controver- 
sial because closely related species differ strikingly with respect to the existence 
and general form of these rhythms. Unlike daily rhythms, annual cycles are a new 
evolutionary development of adaptive significance only to relatively long-lived 
species (63). If, as seems likely, a great diversity of mechanisms evolved indepen- 
dently to mediate annual cyclicity, then the search for unitary mechanisms and 
models based exclusively on a circadian paradigm may be inappropriate (77). 

Among the most convincing examples of circannual rhythms is the hibernation 
rhythm of blinded ground squirrels (Citellus lateralis) maintained for years in a 
constant environment at 3° C, In one individual the intervals between successive 
entries into hibernation were 313, 314, and 310 days, respectively (149). Heller & 
Poulson, studying other ground squirrels (Spermophilus), maintain that “regard- 
` less of the experimental manipulations we imposed upon the animals, the free- 
running period length measured from terminal arousal to terminal arousal is 
generally only slightly less than a year and shows remarkably little variation” (82). 
In yet another species of ground squirrel the hibernation-weight gain rhythm 
damps out after one year and few subjects show a second cycle (136). Endogenous 
annual rhythms may also regulate antler growth in pika deer (Cervus nippon) (73), 
fat deposits in woodchucks (Marmota monax) (51), water intake and body weight 
in ground squirrels (149), and hibernation in chipmunks (Eutamias) (82). The 
evidence that these rhythms are truly endogenous is far less compelling than for 
circadian rhythms. For example, the detection of circannual rhythms is strongly 
dependent on the maintenance of a particular daily LD cycle. Some species 
require a LD 12:12 cycle, while others need a LD 11:13 cycle to express their 
annual rhythms (77); the annual cycle of antler growth in the pika deer occurs on 
LD 16:8 or LD 8:16 but not on a LD 12:12 cycle (73). 

The peculiarities and advantages of the endogenous timers of different species 
that have such stringent and divergent requirements remain unknown. A number 
of annual rhythms may be subject to separate endogenous and exogenous con- 
trols. Part of the cycle is entirely independent of external input (endogenous 
component) and a second part requires a particular LD cycle or temperature to 
trigger or terminate some crucial event. If this exogenous component is provided, 
the annual rhythm is expressed; in its absence the endogenous process is arrested 
at the blocked phase. 
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The interaction of endogenous and exogenous controls in seasonal rhythms is 
clearly illustrated by Reiter’s work with golden hamsters. In these animals short 
photoperiods initiate testicular regression, culminating within 8 weeks in repro- 
ductive quiescence. After 25 weeks of continual exposure to the short photoperiod, 
the gonads undergo spontaneous recrudescence and become functionally mature 
(164). Testicular development and recrudescence proceed independently of any 
need for the long photoperiods (168) otherwise required to maintain testicular 
function, and thus appear to be endogenously controlled. However, a second cycle 
of testicular regression in short photoperiod conditions will not occur unless the 
animals first experience many weeks of long-day conditions (164). 

These findings are in good accord with field observations cited by Reiter (163). 
Short laboratory photoperiods may simulate the conditions hamsters are exposed 
to in the late fall when they enter underground burrows and hibernate in the 
absence of light until the following spring. The 20-week period of sexual dor- 
mancy induced by the short photoperiod prevents delivery of young during the 
disadvantageous fall and winter seasons; the subsequent testicular recrudescence, 
independent of light, insures that sexual competence coincides with spring emer- 
gence and that young are born during the season optimal for survival (165). The 
pineal gland has been implicated in these and other seasonal reproductive events, 
and its contributions are ably documented by Reiter (164). 


Seasonal Infiuences on Behavior 


In addition to prominent annual rhythms in hibernation and reproduction, there 
are more subtle but nevertheless important seasonal influences on physiology and 
behavior (cf 35). One direct seasonal influence is due to the changing availability 
of foodstuffs. Rodents feeding on the protein-rich spring and summer herbage 
grow faster and reach sexual maturity earlier than those ingesting the starchy 
foods available in the autumn (78). Certain opportunistic breeders living in arid 
regions reproduce only after favorable and unpredictable events such as rainfall 
(174). 

Other seasonal influences are less direct. Goldfish perform better in learning 
tasks and are more active in winter than during the summer spawning season. 
Reducing the amount of light to which they are exposed improves summer per- 
formances (179). Under natural lighting conditions the occurrence of estrus in 
pocket mice (Perognathus penicillatus) is little influenced by the presence of males; 
it has a high incidence in spring and summer and low in autumn and winter. 
However, under constant long or short photoperiods the annual cycle is disrupted 
and the presence or absence of males becomes an important variable (144). 
Snowshoe hares (Lepus americanus) gonadectomized during their breeding season 
increase LH and FSH synthesis and release; no such increases in hormone secre- 
tion are detected when gonadectomy is performed out of the breeding season (52). 
Male Microtus are more aggressive in paired encounters when captured in their 
breeding season than at other times of year (193). 

The pattern of daily activity often changes with the seasons (194, p. 345). Three 
species of voles showed a single nocturnal activity peak during summer, but all 
gradually became diurnal with one or more activity peaks during winter (75). In 
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spring, arctic deer mice (Peromyscus maniculatus) have a pronounced daily activ- 
ity rhythm which may be related to effective foraging and mating; in winter, they 
undergo daily torpor and are weakly nocturnal (184). Woodchucks emerging from 
hibernation in late winter show scattered daily activity; by spring they have a 
strong afternoon activity peak, in summer a bimodal pattern, and in the fall 
scattered activity culminating in winter hibernation (33). 

Little is known about whether these seasonal cycles persist under constant 
laboratory conditions, but there is a suggestion that hamsters breed more readily 
in spring than in winter, even under unchanging light and temperature conditions 
(164). 

The particular conditions of illumination, ambient temperature, and availabil- 
ity and quality of food in laboratory studies may mimic those of a given season; 
the effects of subsequent experimental manipulations may therefore be affected 
by the induction or blockade of seasonal phenomena. To select appropriate 
laboratory conditions, one must be familiar with the ecology of the species being 
studied; such knowledge has often been lacking or ignored i in psychobiological 
research. 


RHYTHMS AND BEHAVIORAL FUNCTIONS 


The final sections of this review are concerned with the rhythmic aspects of specific 
behavioral phenomena. Psychologists and physiologists appreciated that behav- 
iors associated with sleep, activity, food ingestion, and reproduction had an 
intrinsic cyclicity long before they suspected that other behaviors might be simi- 
larly. organized. As a consequence, the evidence is far more satisfactory for the 
former behaviors than for the processes involved in sensation, perception, and 
learning. 


Eating and Drinking 


The frequency and periodicity of mammalian eating behavior depend on the 
nature and availability of the diet eaten and may be characteristically quite 
different for carnivores, herbivores, omnivores, etc. Size and metabolic rate, type 
of digestive system, endogenous activity rhythms, and numerous environmental 
factors, viz. temperature, illumination, humidity, presence of conspecifics and 
- predators, and reproductive status, also may dictate the distribution and quantity 
of food consumed. The feeding frequency of smaller nocturnal mammals will also 
depend on the availability of an insulated nest that reduces metabolic demands 
imposed by changes in environmental temperature and una and that pro- 
vides a dark retreat during daytime. 

The survival of mammals and the maintenance of energy balancë depend on 
foraging, hunting, and often returning with food to nests or storage areas. Much of 
the observable spontaneous activity of mammals in the wild is related to feeding, 
and it probably is inappropriate to analyze one without considering the other. 
Eliminating the necessity to search for food may well be a critical factor influenc- 
ing periodicity ọf intake; e.g. in nature the activity cycle and feeding behavior of 
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an insectivorous rodent may be linked to that ofthe invertebrates that form part of 
its diet and which themselves show marked fluctuations in daily activity (17). 

In typical laboratory studies of eating behavior the periodicity of intake is 
established while excess food is freely available at all times. A dark nest box is 
rarely provided, so the nocturnal animal must experience long periods of light or 
modify its behavior to minimize light exposure (e.g. increased sleeping during the 
L phase). The addition of a dark place in the cage stimulated a South American 
rodent (Oxymycterus rutilans) to eat a greater proportion of its food during the L 
portion of the illumination cycle (196). Albino rats given access to a constantly 
dark tunnel and burrow in addition to an open field where food and water were 
available under a LD 12:12 cycle altered the temporal pattern of their eating and 
drinking (Rusak & Block, unpublished observations, 1973). 

The study of the effects of food restriction on subsequent eating is complicated 
by the variety of feeding styles adopted by mammals. Large predatory mammals 
obtain substantial quantities of easily digestible food at irregular intervals and 
may gorge themselves during one relatively brief feeding; they are adapted to fare 
quite well without further food for days. Other mammals, such as those eating 
diets of low caloric density and high cellulose content, may be compelled to eat 
relatively continuously throughout the day and night. This would appear to be 
true of the larger herbivores, including some of the ruminants (17). Some herbiv- 
orous animals have very limited abilities to compensate for periods of food 
deprivation and only slowly regain weight loss during starvation (Silverman & 
Zucker, unpublished observations, 1973). The degree of depletion induced by 
starvation and in fact the survival of rodents may depend on the time of day during 
which food deprivation occurs (138). 


ENTRAINMENT Rats with ad lib access to food and water in a bare laboratory cage 
are nocturnal feeders and drinkers. In a LD 12:12 cycle they eat 75% of their food 
and drink 85% of their water during the dark phase (206). In LD 14:10 a peak of 
eating occurs with the onset of darkness and a second peak about 6 hours later 
(191). The nocturnal drinking rhythm may be partially independent of the noc- 
turnal feeding rhythm (141), and relations between food and water intake may 
differ fundamentally in the different phases of the illumination cycle (206, 207). 

In hamsters drinking is 85% nocturnal on a LD 12:12 cycle, but only 58% of 
feeding occurs during the dark phase (207). In guinea pigs there is no obvious 
nocturnal, diurnal, or crepuscular cycle, and the animals appear to eat and drink at 
regular intervals throughout the day and night (86). Some primates appear to be 
exclusively diurnal feeders, and observations of monkeys (Macaca mulatta) in 
seminatural conditions suggest that they withdraw to trees and do very little eating 
between dusk and dawn (195). 


SHORT-TERM CYCLES ‘There is a significant positive correlation between the rat’s 
meal size and the interval to the next meal (108). This relation does not appear to 
be a general description of the rat’s feeding behavior; according to Panksepp 
(147), it holds during specific metabolic states and is a consequence of statistical 


158 RUSAK & ZUCKER 


procedures of questionable validity. He proposes instead that the “free-feeding 
pattern of rats is generated by an internal signal of body nutrient depletion-re- 
pletion which varies in a circadian fashion” and that the satiating capacity of 
ingested food and the depriving capacity of intermeal intervals vary across the 
day. 

This concept is supported by other findings; e.g. the effectiveness of the sup- 
pression of oral intake by intragastric loading of nutrients depends not only on the 
number of calories infused but also on the temporal distribution of infusions 
relative to the normal time of feeding (159). Factors other than total caloric intake 
thus play a role in determining satiety. Some circadian rhythms in blood amino 
acids are not directly related to ingestion of dietary proteins and are evident on the 
day of birth in human infants (64). These rhythms could be manifestations of a 
neural system that generates a circadian depletion-repletion cycle. 

Failure to simulate environmental conditions more representative of those 
experienced by animals in nature may lead one to generate reliable but invalid 
laboratory data on feeding patterns. Extrapolation from results obtained with a 
particular species, tested under a limited set of conditions, to mammalian feeding 
in general glosses over a wealth of information about species specificities that 
range from radical differences in intestinal and gastric morphology (41) to elab- 
orate food storage techniques practiced by certain mammals (194, p. 323-24). 
Consider for example that albino rats, often studied as models of human feeding, 
reingest approximately 40% of the feces they produce when studied under labor- 
atory conditions. This coprophagy shows a pronounced daily rhythm (114) that 
permits the recovery of valuable nutrients (19). The influence of this cycle of 
coprophagy on meal patterns and on the rat’s adjustment to alterations in food 
availability and in diet quality deserves experimental attention. 


Sleep and Activity 


Animal rest and activity cycles have been investigated more thoroughly than any 
other rhythmic function. Such cycles seem o be fundamental to the temporal 
organization of all animal behavior and are found both in the simplest and most 
complex organisms. The relative ease with which activity can be continuously 
monitored over long intervals recommends it to the student of biological rhyth- 
micity. However, measurement of activity in laboratory settings is subject to many 
of the same problems as were enumerated in the preceding section. The availabil- 
ity and quality of food, the presence of burrows, and the use of gradual light-dark 
transitions may alter the timing of activity (17, 100, 101; Rusak & Block, unpub- 
lished observations, 1973). The distribution of activity as recorded in the labora- 
tory may differ from that in the field (102), particularly if seasonal variations 
occur. | 

The methods used for monitoring activity (stabilimeter, activity wheel, photo- 
cell, etc) influence the nature of the results obtained (14, 67, 169). Nevertheless, 
measures as diverse as chewing, drinking, eating, and wheel running may yield 
quite similar nocturnal patterns when studied in a single species (171). If one is 
concerned with the temporal distribution of these behaviors rather than with the 
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function or topography of the response being measured, then one may be justified 
in studying one endpoint (e.g. wheel running) as an index of the animal’s temporal 
organization. However, since a given manipulation (e.g. a brain lesion or a drug) 
can have different effects depending on the type of device used to measure activity 
(cf 115), it is important to determine the generality of treatment effects by studying 
a number of different responses. This is obviously necessary where one suspects a 
multiplicity of clocks for different functions. 

Researchers have recognized the importance of circadian and ultradian 
rhythms in the control of sleep. Kleitman (104) proposed that the arousal level of 
an individual varies throughout. the day and is a function of an ultradian 
species-specific, rest-activity cycle that persists through the sleep and waking 
states. The proportion of the various sleep stages recorded depends on when in 
relation to their circadian cycle individuals first fall asleep (199). 

The function of sleep has interested many writers and may be, as some have 
suggested (181, 199), to prevent wasteful and dangerous expenditure of energy at 
nonoptimal times of day; the rest-activity cycle would thus ensure that each 
species is active only when its activity has the greatest potential survival value. The 
idea that sleep serves primarily this function rather than one of metabolic recovery 
is supported by data obtained on human (15) and hamster subjects (129); under 
free-running conditions sleep time is inversely proportional to the length of the 
preceding period of spontaneous wakefulness. This relation is predicted by a 
circadian organization of the sleep-wakefulness cycle since the total of rest and 
active periods must equal a stable r value. The opposite is predicted by any theory 
that emphasizes the recovery function of sleep. The development of a sleep “need” 
after periods of sleep deprivation (202a) does suggest that some recovery occurs 
during sleep; this may be a more recently evolved function superimposed on the 
basic timing process of the sleep-wakefulness cycle. 

Exposing burrow-dwelling rodents to obligatory LD cycles may restrict the 
amount and distribution of the various sleep stages (68, 134). In constant darkness 
or when given access to a darkened retreat, rats may make up a chronic sleep 
“debt” incurred under standard laboratory conditions (68). These findings require 
further study and confirmation, but they emphasize the need for consideration of 
species’ ecology, particularly in relation to light exposure. 


Reproduction 


The reproductive physiology and behavior of many mammals has an intrinsic 
rhythmicity. The ovaries of polyestrous spontaneous ovulators pass through follic- 
ular and luteal phases of relatively fixed duration, and the cycle is repeated unless 
interrupted by copulation or pregnancy. The limited life span of ovulated ova 
makes it essential that mating behavior and insemination be temporally synchro- 
nized with the period during which fertilization is most likely to occur. In many 
mammals the coupling of sexual receptivity and ovulation to the same fixed 
sequence of hormonal events assures this outcome. Synchronization of ovulation 
and sexual behavior is mediated in female rats by neural control of the pituitary 
(61, 72, 204) and may possibly involve adrenal participation (116). 
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The endogenous nature of the biological clock controlling the hamster estrous 
cycle has been definitively established (2); in constant illumination behavioral 
estrous cycles free-run with a period greater than 4 days (a circaquadridian 
rhythm). The timer that mediates this cycle probably functions with a circadian 
rather than a circaquadridian periodicity (2). Neural pathways that promote 
preovulatory LH release do so at 24 hr intervals (103, 139), but ovulation occurs 
only on the day of the cycle that coincides with high plasma estrogen levels (139). 
Spontaneous or. pharmacologically induced blockade of ovulation extends the 
cycle by 24 hr rather than by an entire cycle length (2), further supporting a 
circadian organization of estrous cyclicity. Failure to establish free-running es- 
trous cyclicity in rats housed in constant light does not argue against the endog- 
enous nature of this rhythm. The albino rats used in these studies are abnormally 
sensitive to the disruptive effects of constant light (36), and the cessation of 
ovulation and onset of persistent estrus in LL speak to a desynchronization or 
damping of the various hormone rhythms that contribute to the ovulation cycle (cf 
178). 

One unresolved majot question, as noted by Everett (61), is whether the clock 
for cyclic LH release and ovulation exists in both sexes. According to one view (72, 
p. 240), the inherently cyclic preoptic area regulates cyclic release of gonadotro- 
pins in female rats; in males cyclicity of this neural substrate is thought to be 
suppressed during perinatal sexual differentiation. The result is that the adult male 
or the adult female exposed to androgens during the critical period of sexual 
differentiation is acyclic. An alternative hypothesis is that the clock for cyclic LH 
release, ovulation, and cyclic behavioral receptivity is present in adults of both 
sexes; females may differ from males in the coupling of the clock to the hypo- 
physiotropic apparatus for gonadotropin secretion (cf 61) and in the feedback 
effects of steroids on the hypothalamic-pituitary axis. We propose that hormonally 
mediated changes in these functions, rather than in the clock mechanism per se, 
are responsible for the lack of cyclicity in adult male rats. This conclusion is based 
on the observations that the clock for ovulation is present on the day of birth in rats 
of both sexes (79) and on the general ineffectiveness of all chemical agents in 
changing the basic periodicity of biological clocks. 

The induction of sexual receptivity in many rodents ordinarily depends on the 
synergistic action of estrogens and progesterone (157, 204); these hormones act on 
a neural substrate to promote lordosis and other species-specific components of 
behavioral receptivity. The extent to which circadian fluctuations in the number of 
central binding sites for estrogens and progesterone contribute to the time of onset 
of receptivity is unknown. Binding capacity may remain constant on each day of 
the cycle and within any given day. Significant diurnal variation in the secretion of 
hormones could be principally responsible for the behavioral rhythm. The rhyth- 
micity of estrogen secretion is difficult to assess from the published records (93); 
however, the steady and seemingly continuous rise in estrogen secretion between 
vaginal diestrus and proestrus reduces the probable functional significance of any 
such rhythm. The dirunal variation in peripheral plasma progesterone concen- 
trations on each day of the estrous cycle is of adrenal rather than ovarian origin 
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(116), and is probably of some significance in determining the onset of behavioral 
receptivity (178). 

The data referred to above are derived mainly from studies ofrats and hamsters, 
two spontaneously ovulating species. According to Conoway (45), this type of 
ovarian cycle is of restricted occurrence among mammals; induced ovulation is the 
more widespread phenomenon. In the reflex ovulator, ovulation is induced by 
copulation and occurs a fixed number of hours after mating; this could diminish 
the significance of photoperiodic synchronization of ovulation. 

Comparatively little has been written about the synchronizing influence of 
illumination on daily cycles of copulation in male mammals. Beach & Levinson 
(21) report that male rats tested during the dark portion of a LD 12:12 cycle 
ejaculate much more frequently than when tested during the L phase. The peak in 
the male’s copulation in this species coincides with the peak in the female’s 
receptivity. Blind female rats cohabiting with blind males are not impregnated 
except when the female’s free-running period of sexual receptivity overlaps the 
active phase of the male’s free-running rest-activity cycle (17 1a). 

Sadler (174, p. 160) notes that the modal arrangement among mammals is that 
the male has a breeding season which overlaps that of the female; in other 
mammals males are fertile and capable of mating throughout the year and their 
female counterparts may breed continuously or have discrete breeding seasons. 
The significance of the males of certain species maintaining breeding capacity 
during the female’s period of reproductive quiescence may relate to other func- 
tions of testicular hormones, as for example, in influencing aggressive behavior or 
population dynamics (174). For an extensive discussion of the role of environ- 
mental stimuli in establishing readiness to breed in mammals, the interested 
reader should consult the excellent monograph by Sadleir (174). 


Sensation 


What little information currently is available suggests that sensory systems un- 
dergo circadian fluctuations in sensitivity. In the crayfish (Procambarus) the re- 
sponses of retinal photoreceptors and of central visual interneurons to repeated 
light flashes vary across a circadian cycle that depends on the hormonally con- 
trolled, endogenously timed migration of retinal pigments (8). Mechanoreceptors 
of this species also show a daily cycle of sensitivity to stimulation. 

‘The responsiveness of the toad (Bufo bufo) to stimuli which mimic natural prey 
objects fluctuates seasonally. In summer toads emit more prey-catching responses 
to a light object moving against a dark background than to a dark object on a white 
background; in winter responsiveness to the light object is greatly reduced. Firing 
rates of retinal ganglion cells show parallel seasonal changes in sensitivity to these 
stimuli. This effect is apparently achieved by centrifugal control of the retina and 
accomplishes at the sensory level the behavioral changes essential to the toad’s 
annual cycle of feeding and hibernation. The decline in responsiveness to fly-like 
objects as winter approaches coincides with the cessation of feeding and the 
animal’s preparation for hibernation (61a). 

Circulating levels of adrenal corticoids modify sensory thresholds of audition, 


162 RUSAK & ZUCKER 


taste, and vision in several mammalian species (44, 84). The well-established daily 
fluctuations in adrenal corticoid secretion (cf 106) encompass a sufficiently large 
range to measurably affect sensory thresholds in human subjects (84). Increased 
nocturnal sensitivity to stimuli may also account in part for the greater incidence 
of stimulation-induced seizures in mice (177) and rats (198) during the dark phase 
of the illumination cycle. 


Learning and Memory 


The discovery of a time-sense in bees was one of the first demonstrations of a 
circadian organization of learning and memory (197). Bees can be trained to 
return to a particular site at a specific time of day; once trained they return to the 
food site at approximately 24 hr intervals. They can learn to fly to several different 
sites to selectively obtain food at different times of day but cannot be trained to 
return to a given location at long intervals that differ substantially from 24 hr (37, 
p. 160). Such temporal constraints on learning are also evident in rats restricted to 
feeding for one hour per day. When maintained on a 24 hr illumination cycle (LD 
12:12) these animals increase their activity in anticipation of the daily feeding 
(29); they do not show activity in anticipation of feedings given at 19 hr or 29 hr 
intervals, even if raised from birth on 19 and 29 hr illumination cycles (29). 

A starling was trained at a fixed time of day to obtain food in a circular arena 
from the one feeding dish among 12 that lay in the direction of the sun. When 
tested at other times of day the bird did not simply fly towards a feeder in the 
direction of the sun, but compensated for the sun’s movement and flew to a feeder 
in the same compass direction to which it had been trained. Thus the starling did 
not readily learn a simple assocation of food with a salient environmental stimulus 
(the sun), but rather learned a complex rule about the direction of food availability 

relative to the sun’s position and its own internal clock (89, 104a). 

Rats acquire and retain passive avoidance tasks more readily at some times of 
day than at others (50). Regardless of the actual clock time of training, perfor- 
mance of active and passive avoidance behaviors is best 24 hr after training and at 
multiples of 24 hr (48 and 72 hr). Retention is also fairly good at 12 hr multiples 
after training and very poor at intermediate times (91, 92). These effects are 
independent of the actual time of day at which original training takes place. 
Apparently, rats tested in passive and active avoidance tasks learn not only a 
spatial relation, i.e. that a particular area is safe or dangerous for them, but also 
when the area has these properties. The apparent temporal coding of such learning 
has only become evident within the past few years and adds an entirely new 
dimension to the interpretation of learning phenomena (cf 91, 92). 

The circadian involvement in learning is advantageous in that it permits bees to 
visit flowers only, during the limited periods each day when the latter secrete 
nectar. For rats the temporal modulation of what is learned may serve to increase 
the plasticity of behavior. An animal need not universally avoid the place where it 
was frightened or driven off by a competitor; it can return regularly at another 
time of day when the competitor is absent and feed unmolested (cf 40). 
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CONCLUSIONS 


The importance of the temporal organization of behavior is difficult to overesti- 
mate. In nature a premium is placed on animals performing responses at appro- 
priate times of the day and seasons of the year. We have tried to indicate the 
central role of biological clocks in the integration of the animal’s internal milieu 
and in the synchronization of these functions and of behavior to environmental 
periodicities. 

Biological clocks are inherited (69, 172), manifested in the adult animal inde- 
pendently of specific postnatal “learning” experiences (150), and provide the 
substrate for the inherent rhythmicity of most animal behavior. The treatment of 
living animals and their behavior as steady state systems invariant across time of 
day has been common in many branches of psychology and biology but can no 
longer be justified (162). As a research strategy, ignoring biorhythmicity may 
confound the effects of other independent variables, or at least increase the 
variance associated with any given experimental procedure. 

Biochronometry has much to offer psychologists in understanding behavior as a 
cyclic phenomenon whose periodicity serves important functions. By providing 
methods of analysis and a theoretical framework it gives us the tools to deal with 
and understand the temporal organization of behavior. 
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INTRODUCTION 


-~ This review draws upon available work in English published during the period 
1970 through 1973. The reference list was derived from two computer searches 
(Medline and Compfile), Psychological Abstracts, and the contents of relevant 
major journals for the 4 year period. The resulting yield was much too large to be 
included in this review. A more compelling argument for selectivity and evalua- 
tion stems from the state of the field itself: it is diffuse, parochially compartmen- 
talized, and lacking in comprehensive theorizing. 

There is little doubt that the mind is back in style. The words “cognition” and 
“cognitive” occur in current literature with high frequency in such disparate areas 
as memory, perception, and even conditioning. The period of review has seen the 
appearance of three new journals: Cognitive Psychology, Cognition, and Memory 
and Cognition. But the wide currency of the term has not been free of a certain 
amount of devaluation; “cognition” has so many variant meanings and connota- 
tions that the remaining common core seems to be “any activity involving CNS 
activity between its initiatiön and completion.” With so broad a definition there is 
little, save spinal-level reflex, excluded from the class. This will not, therefore, be a 
_ review of “cognition.” Rather, it is a review of “thinking” in the old-fashioned 
sense, where “thinking” is defined as “mental transformations employing sym- 
bolic surrogates of events, objects, and processes, or properties and relations 
thereof.” | 

The theoretical framework underlying research on thinking has shifted over the 
years. There is now very little work explicitly inspired by Gestalt theory or by 
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muscle-twitch behaviorism. Even work with an SR learning orientation has 
declined except in the study of concept attainment, and even here the theories are 
becoming considerably looser. Work employing computer analog models, on the 
other hand, has grown considerably. Of course, the term “computer analog” 
covers a great deal of territory from analysis of essentially perceptual aspects of 
information processing to writing out specific programs for some thinking task, to 
a general theory of production systems. Furthermore, the jargon of information 
processing is widely used throughout psychology, and probably comes as close toa 
universal language as any currently available, although it is often invoked meta- 
phorically rather than used in a meaningful sense. 

Although there are still a number of investigators working on the traditional 
problems or with the traditional techniques of problem solving and concept 
attainment, most recent work takes its theoretical impetus from developments 
outside the traditional rubric. One major influence derives from the genetic 
epistemology of J. Piaget (187) which, while it does provide a comprehensive 
general framework for the study of all aspects of thought, tends to be compart- 
mentalized for most English-speaking psychologists into developmental psychol- 
ogy. Another major source of problems and theory stems from the area of 
psycholinguistics. Finally, there is a good deal of relevant input from the field of 
psychological testing (which space limitations do not permit us to cover). Although 
workers within each of these specialized frameworks are often engaged in similar 
research, they seem unaware of each others’ efforts. One hesitates to invoke the 
tired analogy of the blind men and the elephant but it is certainly apt. In the 
interest of advancement of clear-sighted elephant exploration, the review which 
follows will be organized in terms of content area rather than theoretical orienta- 
tion. 


MONOGRAPHS AND GENERAL SURVEYS 


One indication of sustained interest in the area of thinking is the recent appear- 
ance of a number of textbook surveys which vary in breadth and depth of coverage 
and in relative weighting of theory or data. The most extensive of these in terms of 
coverage of the research literature is Johnson’s (114) restructured and updated 
revision of his popular text. Manis (153) and Gardiner (77) each range widely in a 
broad and readable overview of disparate areas. Bolton (14) also offers an eclectic 
survey, organized with respect to theory rather than with respect to research 
content. Pollio (191) focuses more narrowly upon what he calls “symbolic activi- 
ty,” while Bourne, Ekstrand & Dominowski (18) concentrate upon problem solv- 
ing, concept attainment, verbal learning, and language. 

A number of monographs, some of which will be discussed in later sections of 
this review, have appeared. Foremost among these is the long-awaited book by 
Newell & Simon (176) on human problem solving. Other works on problem 
solving with a much more applied practical focus are a series of exercises to 
develop visual thinking (157), a review of educational and industrial applications 
(44), and a computer simulation of selected aspects of thought (42). In a class by 
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itself is Razran’s (197) heroic attempt to fit thought into an evolutionary perspec- 
tive. 

Three volumes which bring together scattered works have appeared: notes from 
Wertheimer’s seminars (149), Maier’s (151) collection of his own research papers, 
and a monograph in which Wason & Johnson-Laird.(259) review and integrate a 
number of research studies conducted by them with their colleagues and students 
over the years. Finally, a translation of A. N. Sokolov’s book on internal speech 
and thought (240) has appeared. 


PROBLEM SOLVING 


Interest in this area appears to be declining. Much of the research on problem 
solving proceeds at a deadening low level of empiricism: choose a problem, any 
problem, and see what variables affect it, in what way. Traditional problems seem 
. to be chosen for the same reason that mountains are climbed, because they are 
there. Except possibly among test constructors, there is almost no prior analysis of 
what aspects of problem solving are to be isolated for scrutiny, why they are of 
special interest, or how best to analyze their role in relation to that of other 
behaviors. There is insufficient concern with validity of tasks, generality of 
findings, or whether independent variables affect the thought process rather than 
the context of performance. Admittedly some of these judgments presuppose the 
existence of a theory, and good general theories of problem solving are not in 
abundant supply; even a serviceable taxonomy of problems is lacking. 


Insight Problems 


The insight problem, cherished by Gestalt theory, emphasizes the role of percep- 
tual restructuring in problem solving. Typical examples are Maier’s two-string or 
coat rack problems, Duncker’s candlestick problem, and the problems used by 
Köhler with his apes. Beck (7) reports on cooperative tool use in hamadryas 
baboon. Food placed outside the cage and out of reach could be obtained by use of 
an L-shaped stick not directly accessible to a large male who had demonstrated 
tool use prior to the start of the study. He could get the tool only through help of 
another smaller animal. After the first few trials the tool was fetched by a co- 
operative female, who, interestingly enough, never directly used the tool herself. 
Nor did any other animals in the cages appear to learn from imitation. Among 
chimpanzees, on the other hand, Menzel (159) reports that discovery that a pole 
could serve as a Jadder was not only imitated but also widely generalized on a 
variety of occasions. 

Work with human Ss has been concerned with factors promoting solution. 
Three studies examined the role of incubation. Murray & Denny (166) speculated 
that the facilatory effect of time away from the task would be a function of 
problem difficulty and/or the subject’s ability (facilitation from a 5 minute break 
should occur for harder problems and low ability Ss). They found that more low 
ability Ss solved with a 5 minute interruption followed by 15 minutes on problem 
than solved on 20 minutes of continuous effort; for high ability the reverse was 
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true. It was suggested that interruption facilitated by weakening interference, but 
protocols were not analyzed for direct evidence of this. Dominowski & Jenrick 
(50) repeated the experiment with an additional independent variable (hint as to 
solution) and a different problem but did not replicate the finding. However, the 
meaning of incubation was a bit ambiguous since $ was not forewarned that he 
would return to work on the problem. Fulgosi & Guilford (70) varied both 
duration of interruption and presence or absence of forewarning; the latter had no 
effect upon solutions to a divergent thinking task whereas the former did. A longer 
interruption led to more, but not necessarily better, solutions. 

Other research on insight problems has been concerned with sources of diffi- 
culty (152), effect of hints (25), relative availability of potential solutions (11, 96, 
261) and transfer from various types of training (245). Each study demonstrated 
effectiveness of the independent variables investigated except the first, which used 
2 different orders of each of 6 insight problems on a total of 544 subjects. The 
authors attribute their finding to differences among individuals as to what consti- 
tutes a source of difficulty for a given problem. 


Anagram and Other Word Problems 


As might be expected in an area as old and well established as anagram problem 
solving, there are no startling new developments. Most research is concerned with 
clarifying interactions among established variables (52, 80, 231, 257), refining 
indexes of performance (215), or for prediction thereof (80, 231); or proposing 
possible new independent variables associated with the task (113, 139, 241) or with 
the subject (138, 144, 158). Norms for solution frequency for multisolution five- 
letter anagrams with three to seven solutions have been reported (101). 

One interesting and promising line of investigation takes variables such as 
priming or word imagery, whose effect has been explored in a verbal learning 
context, to search for similar effects on the anagram task; Jablonski & Mueller 
(113) found remarkably little effect of both these variables. A similar approach has 
been employed in exploring the effect of associationist variables upon related 
word problems such as predicting a hierarchy (51), filling in blanks (246), and 
analogy problems (82, 194) with predicted effects observed. 


Mathematical Problem Solving 


Use of mathematics problems as a vehicle for study of problem solving is quite 
common in East European literature which, with few exceptions (9, 10) is un- 
available in English. Work under this general rubric ranges from studies possessing 
some direct application to education (35, 54, 146, 192, 242) to models for limited 
aspects of basic operations (1, 12, 93, 94, 180, 181, 185, 200) to theoretically 
oriented consideration of the foundations of number (21, 81), rules (218), and 
functions (103). The models for basic computational operations attempt to predict 
the time required for S to perform addition (92), subtraction (94), or multiplica- 
tion (180) of single digit quantities m and n as a function of the relative magnitudes 
of the two quantities. Although research designed to test these models is well 
conceived and executed, it does not provide resounding support of the model; nor 
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is it clear how the model would be extended to more complex problems. An 
alternative analog model for addition proposed by Restle (200), which is not 
limited to single digit value and sums, was not supported by the findings of Aiken 
(1). It is not clear why one should expect a computer analog model for these 
simple, highly overlearned operations to apply; such models might be more 
appropriately applied to mental calculation of more complex quantitites (even by 
lightning calculators). 

In attempting to describe the ontogenesis of number concepts, Brainerd (21) 
and Gelman (81), working from different orientations and different reviews of 
the literature, independently reached the same conclusion that ordination is the 
first step. 


-Work with Other Kinds of Problems 


Although there are a number of well-known problems whose structure has been 
analyzed and which would seem to be strong possibilities for more intensive 
investigation, there turns out to be relatively little work done with most of them, 
e.g. the tower of Hanoi. Greeno (91) has begun some systematic effort with a 
version of the missionaries and cannibal problem, and Marköva (154) offers a nice 
analysis of the effect of task complexity upon choice of strategy in the game of 
NIM. A similar analysis of Mills, a higher order tic-tac-toe, has been done by 
Schiopu (219). There is also an analysis of the structure of several games along 
with introduction of a new one, HALMA (55). 


Models of Problem Solving Processes 


The diagnostic problem solving paradigm, by virtue of its simple structure and 
seeming wide generality, would seem to offer a natural starting point for con- 
struction of a descriptive model of problem solving. Moreover, there exist many 
computer programs for this function, e.g. medical diagnosis, which could provide 
a prescriptive model for comparison against human practitioners. Nevertheless, 
only one model for diagnostic problem solving has appeared since 1970 (131). 
Unquestionably the major contribution to a general theory of problem solving is 
the book by Newell & Simon (176), which deals in detail with three classes of 
problems: cryptarithmetic (solving for a code which substitutes for letters a unique 
digit equivalent), developing proofs in symbolic logic through application of 
appropriate rules, and choosing moves in chess. Each class of problems is sub- 
jected to a task analysis, the program is introduced and data for select subjects are 
presented and analyzed [as Johnson (1 14a) has noted, with something like Freud- 
ian thoroughness] in relation to the program. All subjects are educated, articu- 
late, skilled adults who are instructed to think aloud while solving; their protocols, 
supplemented in some instances by eye-camera data, constitute the major data. 
The essence of the theoretical treatment is that human problem solving is to be 
understood through a description of three major determinant classes: (a) the task 
environment in which it takes place; (b) the space the solver uses to represent the 
environment, the task, and his accumulating knowledge about it; and (c) the 
program assembled for solution. Four propositions determine the relation among 
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the three components: 1. Only a few gross characteristics of the information 
processing system are invariant over task and solver. 2. These characteristics are 
sufficient to determine that a task environment is represented as a problem space 
and that solution behavior occurs there. 3. The structure of the task environment 
determines the possible structures of the problem space. 4. The structure of the 
problem space determines the possible programs that can be used for problem 
solving. 


The Role of Memory in Problem Solving 


Memory has assumed such a prominent role in theories of problem solving (91, 
105), concept attainment, and thinking in general, that it is well to consider the 
subject in a bit more detail. Most theories assume that thought-in-process takes 
place in an informationally limited processing space such as short-term memory 
(STM) or William James’ “footlights of consciousness.” Other relevant informa- 
tion, knowledge, skills, etc are stored in a more capacious accessory, generally 
called long-term memory (LTM), which is in constant communication with its 
short-term counterpart for purposes of information transfer in either direction. 

The central property of many types of problems, which causes them to be 
problems, is the need to order large amounts of information so they can be dealt 
with in a manageable way. This may be achieved by development of more efficient 
processing strategies, codes, chunks, or external aids; i.e. it is not clear whether one 
is dealing with S’s memory as a knowledge state, his repertoire of solution pro- 
grams, some feature of task structure, or a combination of some or all of these. 
Consider, for example, the problem of uniquely assigning one element from each 
of several property classes to a coinciding category, as in the problem posed by 
Fisher (67) to introduce his treatment of Latin Square. Such problems are consid- 
erably simplified, if not trivialized, by resort to a recording and ordering algo- 
rithm such as the Latin or Graeco-Latin square, as has been demonstrated by 
Schwartz (221, 222). Similarly, there is an accumulating body of evidence that 
children’s utilization of information improves considerably when they are pro- 
vided with an external memory aid (53, 213). 

In contrast to the essentially task-property aspects of memory noted ge are 
the subject-related aspects which differentiate the novice from the pro. These have 
to do with the quality of the memory code and/or the resultant chunk (but most 
probably the former); relevant data come from developmental studies of memory 
and from analysis of skilled chess players. Among the developmental evidence 
(136, 164, 175) the Lehman & Goodnow (136) study provides especially convinc- 
ing evidence for a shift in the nature of the code for patterned pencil taps from an 
iconic reproduction to a more abstract digit series. Still more abstract characteri- . 
zations exist among adults in dealing with serial patterns (92, 201, 202, 232), There 
is even some evidence (239) that systematic training improves memory skills to a 
limited extent. Simon and his associates (28, 233-235) have studied the memory of 
chess players for meaningful—as contrasted with random—positions of pieces and 
find very superior memory among skilled players, which is explained in terms of 
chunking and adequately stimulated (235) by a new program combining percep- 
tual and learning components. 
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CONCEPT IDENTIFICATION . 


Space limitations prevent review of the background to recent work in concept 
identification (CD); the interested reader can consult several sources (18, 69). 


Mathematical Models of CI 


Bower & Trabasso’s (20, 252) extension of Restle’s (199) CI models provided the 
focal thrust of stochastic modeling in the 1960s. The initial Bower-Trabasso model 
(20) conceives of CI (in the single attribute paradigm) as a process of hypothesis 
testing. The S learns a concept by sampling a single hypothesis randomly from the 
set of possible attribute-value hypotheses. The hypothesis is maintained as long as 
it is successful and discarded following an error. After an error, S resamples from 
the total set of possible hypotheses (i.e. an error is a recurrent event). The model is 
simple and powerful but so implausible that it has generated a large research effort 
to modify or refute it. 

First, the memory assumptions were relaxed to assume limited memory con- 
straints on the resampling process (e.g. 252). A number of investigators (2, 41, 61) 
have shown that Ss exhibit some degree of local consistency, i.e. the probability of 
repeating the same response on a sequence of adjacent success trials is a mono- 
tonic function of the number of stimulus values preserved across adjacent trials. 
Such sequential effects can be predicted by a simple extension of the Bower- 
Trabasso model (41), but the prediction is improved by assuming one- or two- 
trial memory. 

Next, the single hypothesis assumption was eliminated, and Ss were assumed to 
sample a set of working hypotheses. This set of working hypotheses was allowed to 
change during sequences of successes by assuming that Ss temporarily eliminated 
hypotheses inconsistent with the reinforcement (30, 252). This assumption of 
focusing on successive positive trials was dictated by data (140) suggesting an 
increase in probability of correct response preceding the last error, a finding at 
odds with the all-or-none learning axiom of Bower & Trabasso’s original model. 
In addition, several investigators noted a decrease in latency during runs of success 
trials both before and after the trial of last error (TLE) (57, 61). An interesting 
extension of these data was provided by Fink (66), who manipulated the number 
of attribute values available to Ss and found a decrease in latency beyond the 
TLE, but only for conditions with multiple dimensions available. Fink also asked 
Ss to identify the point at which they felt they had solved the problem. Latencies 
were constant from that point on for all conditions. Perhaps Ss can be trusted to 
know what they are doing in CI experiments, a point with interesting implications 
to be considered later. 

Assumptions of focusing during runs of success trials have also been supported 
by the analysis of several dependent measures other than latency. Coltheart (38) 
and Falmagne (64) both found that S’s confidence in hypotheses increased as 
a function of successive confirmations and decreased sharply after a dis- 
confirmation. These data can be interpreted to support the Trabasso & Bower 
(252) assumption of focusing on positive trials (38). Another measure which 
suggests focusing during success runs was obtained by counting the number of 
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attributes Ss choose to examine while solving a CI problem. Chatfield & Janek 
(29) and Millward & Spoehr (161) developed ingenious techniques which allow Ss 
to select only those attributes of interest on a particular trial. They find that the 
number of selected attributes (a) decreases following success trials, both before 
and after TLE; and (b) increases following error trials (29). Another source of 
evidence in favor of multiple hypothesis processing is the effect of intertrial 
interval (ITI) upon performance: longer intervals (up to an optimal level) improve 
performance (31, 62). These data are consistent with the Trabasso-Bower multiple 
hypothesis model, and can be explained by assuming that longer ITIs allow S to 
sort out his hypotheses with respect to trial outcome information. 

Several extensions of the multiple hypothesis approach involve wholist strat- 
egies in which AS checks all possible hypotheses simultaneously (31, 264). Both 
models assume that S entertains a set of active hypotheses on each trial, but 
neither assumes that an error is a recurrent event. Chumbley (31) assumes that on 
each trial S partitions his functional hypotheses into two subsets consistent with 
the two possible responses, e.g. those leading to response A and those leading to 
response B. If S is successful on the current trial, his active hypothesis set is 
reduced to the consistent subset on which his choice was based. On the other hand, 
if $ is incorrect, he will attempt to recover the other subset and use it to continue 
focusing. Subjects resample a new set of active hypotheses only when they fail to 
recover the consistent subset, or when the consistent subset is reduced to zero. The 
subset sampling axiom is supported by Levine’s recent finding that Ss seldom 
consider the solution dimensions prior to solving (86, 95, 141). He notes that Ais 
presolution performance on solvable problems is similar to behavior of Ss work- 
ing on insoluble problems. Failure to learn solvable problems is ascribed to subset 
sampling, excluding the solution dimension. 

An attribute elimination model proposed by Wickens & Millward (264) as- 
sumes A samples an active set of hypotheses, checks the consistency of all hypoth- 
eses on each feedback trial, and resamples to form a new active subset when the 
current active set is reduced to zero. The process does not distinguish between 
errors and success, and S is assumed to eliminate all inconsistent hypotheses on 
every feedback trial. Such a model is considerably more efficient than those 
previously proposed and probably applies only to sophisticated Ss. In fact, 
Wickens & Millward (264) gave Ss a large number of successive CI problems and 
found that the Trabasso-Bower model predicted performance on the first or 
second problem, but Ss solved subsequent problems much more quickly than 
could be predicted by models which assume that errors are recurrent events. 

Falmagne (61) noted that the Trabasso-Bower model can account for a general 
decrease in latency during a run of success trials, but the decrease should be the 
same during a run of successes regardless of where the success run begins. She 
found a greater decrease in latency on success runs prior to last error than on 
the success run following the TLE. This finding led her to postulate a single 
hypothesis strength model of CI. On each trial S either makes an hypothesis or 
fails to do so, in which case S can be considered to be in a passive state. Confir- 
mations and disconfirmations influence the strength of hypotheses such that, in 
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general, confirmation leads to an increase in the strength (or sampling weight) of 
all confirmed hypotheses. Similarly, disconfirmation leads to a reduction in the 
strength of all hypotheses consistent with the negative outcome. 

The notion of a passive state emphasizes a separation of learning and respond- 
ing. On some trials $ fails to form an active hypothesis and simply guesses, but this 
has no influence on learning since conditioning occurs with every reinforcement 
independently of hypothesis behavior. On every trial there is some probability that 
Ss will resample an active hypothesis, although the probability of resampling 
following a success trial is much lower than following an error. 

Falmagne (61) is able to predict the discrepancy in latency for success runs 
before and after last error and to explain a variety of confidence rating data as well 
as provide some predictions of sequential statistics (2, 38, 61, 63, 64). However, 
these data can be predicted as well by a variety of multiple hypothesis models (31, 
252, 264). 

In summary, the stochastic models of CI began as simplistic hypothesis testing 
models (20) and evolved to include greater memory of hypotheses, more sophis- 
ticated manipulation of hypothesis sets, and the possibility of learning from both 
success and error outcomes. These modifications have clearly made the stochastic 
models more plausible explanations of CI. However, the expanded models still 
have a number of serious limitations. First, while memory for hypotheses is now 
assumed, memory of instances is not, despite relatively strong evidence that Ss 
display memory of at least some prior instances (18, 104). These data suggest that 
Ss remember instances other than those in the immediate vicinity of the current 
trial. The failure to consider memory of instances (other than assumptions of local 
consistency) is probably not particularly important within the constraints of the 
experimental paradigms for single attribute CI. However, if the models are to be 
extended to more complex problem domains, it is apparent that instance memory 
must be acknowledged. 

A related problem of stochastic models is their application to a very limited class 
of CI problems; single attribute classification is hardly representative of the set of 
possible classification schemes. Although stochastic models have been extended to 
more complicated problems, the stimuli employed are easily analyzed into a small 
number of dimensions. Such stimuli and concepts are indeed rare in the real 
world. a 

Finally, the goal of the various stochastic models has been to encapsulate the 
strategy of concept identification. Each model formulates axioms for a particular 
strategy or way of approaching the CI task. This approach persists in spite of the 
overwhelming data suggesting strategy variability both between and within people 
(23, 115, 161). In light of this variability, existing stochastic models could, at best, 
describe some sort of central tendency or the approach to a concept problem of 
some nonexistent average man. It is also instructive to note that the general class of 
Markov models typically used assume underlying distributions, such as the geo- 
metric, which build an extremely large variance into the models. Indow & Suzuki 
(111, 112) demonstrated the impact of large variance by showing that the 
Bower-Trabasso model was able to provide an excellent fit to their data, although 
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they clearly identified three subsets of Ss who used wholist focusing strategies, 
single hypothesis strategies, or unidentifiable strategies. . 


Models of Complex Attribute Identification 


Trabasso & Bower treat conjunctive problems as two independent subproblems, 
each solved in the same hypothesis sampling fashion as outlined in their previous 
model (20). Nahinsky (167) extended the hypothesis approach in a manner that 
parallels the modifications of single attribute models. He assumed that Ss solve a 
conjunctive problem by sampling a focus set of conjunctive hypotheses from the 
initial positive instance. On subsequent positive instances, if a value of an active 
hypothesis changes, then that hypothesis is deleted from the focus. On error trials, 
S is assumed to delete from the active set all hypotheses whose values remain the 
same. The S resamples a new focus set of hypotheses when the set of active 
hypotheses is reduced to zero. Resampling takes place from the complement of 
those hypotheses active on the trial of sampling. 

A somewhat different model of conjunctive attribute identification is provided by 
Williams (265) in an extension of the computer oriented models of Hunt and 
associates (106). Williams assumes that S selects a focus sample of stimulus 
attribute values and stores them in a short-term memory, together with the trial 
outcome. Attention is focused primarily on those aspects of the stimulus that relate 
to the current hypothesis, and storage is dependent on the strength or saliance of 
the attribute. A disconfirmation generally reduces the strength of a dimension, 
except when S forgets about the irrelevance of an attribute, in which case its 
strength increases over successive trials. New hypothesis selection derives from a 
search through the focus sample in STM for the most frequently occurring attri- 
bute which partitions positive and negative instances. A second dimension con- 
sistent with the first part of the hypothesis is then selected in the same fashion. 
Hunt’s (106) work on CI simulation demonstrates that Williams’ basic approach 
can be used to account for a wide range of complex problems. Nahinsky’s model, 
on the other hand, seems implausible for problems with more than two relevant 
dimensions because the number of potential hypotheses becomes exceedingly 
large. 


EMPIRICAL RESEARCH ON ATTRIBUTE IDENTIFICATION For purposes of exposition, 
recent research efforts may be roughly classified as follows: 

1. Strategies (23, 83, 84, 115, 133, 134, 160, 161, 262, 263). 

2. Stimulus variables: intradimensional variability, irrelevant dimensions, rele- 
vant redundancy, concreteness (32, 98, 120, 133, 147, 167, 168, 177, 212, 220). 

3. Trial outcome (blank, positive, negative) and number of response categories 
(2, 17, 37, 49, 100, 122, 132, 135, 142, 206, 238, 251). 

4. Transfer of concepts; reversal and nonreversal shift paradigms (3, 56, 124, 
126-128, 148, 162, 163, 205, 230, 255). 


STRATEGIES There has been some recent interest in distinguishing hypothesis vs 
associative strategies of CI. White (262, 263) obtained limited evidence suggesting 
that it is possible to train Ss to use an hypothesis-oriented strategy in a conjunctive 
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problem. Subjects given unsystematic pretraining with an arbitrary paired- 
associate task evidenced incremental learning, whereas Ss trained on a rule- 
governed problem showed a stationarity of presolution responses, suggestive of 
an all-or-none, hypothesis-oriented strategy. Millward & Spoehr (161) also 
present evidence that Ss learn a variety of systematic strategies over a sequence 
of CI problems. 

Bruner and his associates (23) identified Ss who solve CI problems by attending 
to stimulus attributes (focusers) and those who test hypotheses (scanners). Within 
each of these classes two extremes have been identified as follows: 1. Focusers: (a) 
conservative (varying one attribute at a time; (b) gamblers (varying more than one 
attribute). 2, Scanners: (a) successive (one hypothesis at a time); (b) simultaneous 
(checking multiple hypotheses). Johnson (115) employed an interesting modifi- 
cation of the selection paradigm, which he calls the minimum information feed- 
back procedure, in an attempt at more objective identification of concept 
strategies. A dialogue between A and a computer was established such that S could 
select instances and request instance names or check hypotheses. There was no a 
priori solution to the problem, but feedback was provided in a way that minimized 
the elimination of potential solutions. With this procedure, Johnson found it ` 
possible to distinguish three basic types of solution: conservative focus, successive 
scanning, and a generalized tactical approach. The “tacticians” were characterized 
by a deliberate but unsystematic approach to the problem which typically led to a 
quick solution. Johnson was able to obtain several independent identifications of 
these three strategies by employing various multivariate clustering techniques. 

Several investigators have objected to attempts at specifying a small number of 
strategies (133, 134, 160). They note that it is possible to view the various strategies 
as a single continuum, and point out certain ambiguities associated with the 
dichotomy of focus and scanning strategies. On a problem with two values per 
dimension, for example, perfect focusing, varying one attribute at a time, is 
logically equivalent to simultaneous scanning (all possible hypotheses are parti- 
tioned into tenable and untenable sets). Focus gambling and simultaneous scan- 
ning can be mapped similarly onto each other. These apparent ambiguities led 
both Miller and Laughlin to a detailed analysis of the characteristics of perfect 
solutions under various strategies. In general, they find that in spite of the logical 
ambiguities, various strategies do lead to different efficiency of solution both in 
theory and practice, with the conservative focus being most efficient. 

Laughlin (134) created a continuous measure of solution efficiency and focusing 
which can be applied to a variety of tasks and strategies. In a recent experiment 
(133) he found solution efficiency to be a direct function of the degree of induced 
focusing. Laughlin (135) has also extended his analysis to predict the relative 
difficulty of attribute identification under various logical rules, e.g. conjunctive, 
disjunctive, conditional, etc. On the basis of an analysis of the difficulty or “strain” 
required to use the focus strategy, he was able to predict correctly the ordering of 
difficulty for eight rule types. Giambra (84, 85) has replicated the predicted 
ordering with minor exceptions, but failed to find the correct order of difficulty 
when Ss were given nonexemplar start cards. 

Laughlin (134) treats all strategies as a continuum representing various degrees 
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of focusing. Bourne (17a) points out that it is not clear that Laughlin’s index truly 
measures a focusing strategy, and in any case, the index is indistinguishable from 
problem solving efficiency. In other words, the existence of a scale which orders 
strategies does not lead directly to the conclusion that all strategies are variants of 
the same process. In fact, Johnson’s (115) ability to identify distinct groups of 
strategies with multivariate clustering techniques seems to suggest that the various 
strategies are not simply approximations to pure focusing. 


STIMULUS AND ENCODING VARIABLES Haygood, Harbert & Omlor (98) reported a 
somewhat counterintuitive finding that Ss were able to solve both uni- and bi- 
dimensional problems faster as the number of levels/dimension was increased. 
These data can be interpreted on the basis of a positive focus strategy. With such a 
strategy S attends only to the positive instance—attributes which have not changed 
value. As the number of values/dimension increases, so also does the rate of 
change on irrelevant dimensions. However, an alternative explanation is possible. 
Since the paradigm of Haygood et al employed 50% positive instances, an increase 
in the number of values also changes the relative saliance of the positive value. The 
"solution value must appear, on the average, on every other trial (50% positive 
ratio), but with six levels an irrelevant value would appear on only one-sixth of the 
trials. Recent experiments have failed to resolve this controversy (32, 133, 177, 
220). 

A few studies examined the effects of relevant redundant cues which typically 
lead to faster learning (147, 212). Rollings, Bethel & Deffenbacher (212) noted that 
the separation of redundant cues is an important variable in a problem which 
involves a verbalized string of dimension values, e.g. large, red, triangle. Subjects 
solved faster when the redundant cues were separated, and showed more two-cue 
solutions when the cues were adjacent. Solutions were fastest when the encoding 
order was grammatical, e.g. large, red triangle as opposed to triangle, red, large. 

Despite the overriding emphasis on dimensional analysis of stimuli, there has 
been work on effects of exemplar representation and encoding. Nahinsky (167, 
168) showed that with geometric drawings or concrete word problems Ss sample 
sets of values as mutually interacting parts, whereas with abstract word problems 
they sample dimensions independently. 


TRIAL OUTCOMES AND RESPONSE CATEGORIES Levine’s (140) blank trials pro- 
cedure for hypothesis identification has been widely employed (2, 122, 142, 
206, 238). In the blank trials procedure blocks of nonfeedback trials are inserted 
into the CI sequence. The procedure is useful only if it does not disrupt normal CI 
processing, and several studies (2, 140) have shown no effect of nonreinforced 
trials upon number of errors to solution in single attribute problems. However, 
analyses of local consistency reflect clear differences between reinforced and non- 
reinforced trials (2). Levison & Restle (142) have also shown that nonreinforced 
trials reduce the number of multiple cue solutions, but not number of errors, in 
problems with relevant redundant cues. 

An interesting postcript to the blank trials procedure is a demonstration by 


THINKING AND CONCEPT ATTAINMENT 185 


Karpf & Levine (122) that hypothesis identification with the blank trials procedure 
coincides with S’s own description of his current hypothesis. These data suggest 
little need for the blank trials procedure in single attribute tasks, but overt hy- 
pothesis statement can reduce errors with more complex problems (49), 

Laughlin (132) demonstrated that the completeness of feedback influences ease 
of attribute identification in a wide variety of multiattribute problems. Subjects 
solve a conjunctive problem more quickly when they are told if the stimulus 
contains both critical values (TT), only one (TF, FT), or neither (FF). They are 
fastest when provided with a full truth table specification of the stimulus. More 
recently, Bourne (17) demonstrated that the availability of positive and negative 
classification information improves performance even when full truth table in- 
formation is provided. 


TRANSFER PARADIGMS Interest continues in the special transfer problems created 
by dimensional shifts (125, 128). Explanations of the shift phenomenon fall into 
two broad classes: mediational and attentional. Anderson, Kemler & Shepp (3) 
identify four subclasses of attentional theories which derive from independent 
assumptions abcut selective attention (responding to a subset of the stimulus) and 
dimensional learning. They argue that the transfer shift data can be explained on 
the basis of either selective attention or simply dimensional learning. The latter 
explanation is supported (124) by evidence that Ss learn about both relevant and 
irrelevant dimensions. 

These extensions of attentional theory make it almost indiscriminable from a 
mediational point of view. However, attempts continue to contrast mediational 
and attentional theories of the overlearning reversal effect (148, 205, 255), the 
optional, and the intra- and extradimensional shift paradigms (126, 127, 230). 
These data provide little advancement in our understanding of human conceptual 
behavior, but appear as the remnants of a waning controversy. 

Modigliani (162, 163) has explored a different set of transfer problems and finds 
that Ss are quite sensitive to both irrelevant dimensions and added values on the 
relevant dimension in CI transfer situations. He notes that one must be careful in 
concept transfer tasks because it is quite possible for Ss to restrict the generality of 
their learning to a specific set of stimuli. Furthermore, his work adds support to the 
notion that Ss learn about stimulus configurations in addition to relevant dimen- 
sions. 


COMPLETE AND RULE LEARNING PARADIGMS Ina rule learning task, S is provided 
with the relevant stimulus attributes and asked to discover the logical rule that the 
experimenter used to partition the stimulus domain. Considerable attention has 
been focused on the relative difficulty of rules based on the various logical con- 
nectives. There has been broad agreement that the rules can be ranked by difficulty 
in this order—affirmation, conjunction, inclusive disjunction, and conditional (19). 
The biconditional is sometimes easier, but usually more difficult than the condi- 
tional (85, 217). Peters & Denny (184) suggest that the conditional might be easier 
learned than the biconditional because it contains a smaller and more homoge- 
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neous nonexemplar class, i. e. in terms of a truth table classification the conditional 
treats TT, FF, and FT as positives and TF as negative; for the biconditional TT 
and FF are positive and TF and FT are negative. They suggest that the conditional 
might be easier because S can focus on the negative class and simply learn TF is 
negative. These data support such a contention by showing that the biconditional 
is easier than the conditional when Ss are induced to focus on the positive 
exemplars by the use of response labels. Gottwald (89) has shown a similar effect 
of labels in directing focus on positive or negative instances. Haygood & Bourne 
(97) proposed a multistage model of complete learning based on a relatively 
efficient truth table analysis of the problem. They conceive of S as going through 
successive stages of attribute identification, assignment of instances to a2 x 2 
truth table, and finally inferring the rule from the outcomes assigned to each truth 
table classification. Chumbley (31) presented a similar two-stage model for con- 
junctive problems. These approaches are supported by data suggesting (a) spon- 
taneous acquisition of the truth table strategy over successive problems (16), and 
(b) improved performance after instructions on the truth table method (17, 48). 
Chumbley’s two-stage version of this type of model has also received some support 
(196, 249). 

The assumption of a truth table strategy as a model of CI produces at least a 
small credibility gap. Perhaps Ss are capable of such an effort, and they may even 
be induced to use the strategy, but it is certainly not typical of the untrained S. In 
general, the truth table might be more descriptive of E generating stimuli than of S 
solving a CI problem. 

The most plausible and comprehensive models of the complete CI task are those 
of Hunt (106) and Ruskin (215a). Hunt uses a simulation approach with memory 
of prior stimuli to determine potentially relevant attributes, and then computes a 
concept by creating a decision tree based on attribute-value tests. Ruskin (215a) 
uses a more associative approach to derive a class of stochastic adaptation models. 
He assumes that Ss condition and remember stimulus-response information and" 
eliminate responses based on adapted dimensions. 


Critique of the CI Literature 


The recent research and theory on CI provide some useful information about 
attention, discrimination, and logical rule usage, but unfortunately they have little 
to say about the identification, use, or formation of natural concepts. There has 
been only a limited effort to expand the horizons of CI to explore natural language 
concepts (172, 216) or such domains as color and form (99, 214). But, in general, 
existent CI models fail to capture the richness inherent in human conceptual 
behavior. As Wittgenstein (266) pointed out, natural concepts such as “dog” or 
“game” cannot be formed or identified on the basis of a partitioning rule because 
they do not necessarily share a set of common attributes. 

A large part of contemporary CI research goes astray simply by choice of 
stimuli. Sampling of rules and dimension hypotheses is only likely with analyzable 
stimuli of low dimensionality. Furthermore, the dimensions in CI stimuli are 
usually uncorrelated and provide no basis for classification except as determined 
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arbitrarily by the experimenter. Natural concepts, on the other hand, are usually 
based on a correlation of dimensions and a consequent clustering of exemplars. 

Again, as Wittgenstein noted, the natural concept is determined not by rule but 
by similarities which are often vague and perhaps best thought of as “family 
resemblances.” Phrased in this fashion the CI problem becomes one of judging 
relative similarity rather than identifying exemplars by rule. Work on pattern 
recognition, therefore, may be more relevant to naturalistic human conceptual 
behavior than traditional literature on CI. Of particular interest is recent work 
suggesting that Ss often react to complex stimuli as integrated entities prior to the 
full dimensional analysis proposed by conventional CI models (79, 145, 226). 

Even more pertinent to understanding CI is work on classification of patterns 
based on knowledge of prototype (102, 198). Reed (198) provided an excellent 
demonstration that Ss learn to classify schematic faces on the basis of proximity to 
class prototypes. His Ss seemed to abstract a prototype from the known exemplars 
of a category, and then judged subsequent membership on the basis of similarity 
to this prototype. Reed’s approach uses stimuli which are dimensional yet more 
natural than those typical of CI experiments, and he appears to capture at least the 
spirit of Wittgenstein’s notion of “family resemblance.” The idea of a prototype at 
the center of a loosely bounded concept provides a reasonable approach to the CI 
problem, but leaves unanswered questions of how the concept and prototype are 
formed. 


LOGIC AND THOUGHT 


Logical reasoning, e.g. within the framework of the classical syllogism, became a 
subject for investigation early in research in thought. At that time it was well 
established that logical rules do not describe psychological behavior of adult 
thinkers. Recently there has been a great increase in the study of logical thought as 
narrowly defined in relation to formal logic. Motivation for this research varies 
widely from theoretically based research to concern with empirical questions such 
as how language affects logical reasoning, how people interpret logical language, 
and the psychologic of reasoning from there; some research has even been inter- 
preted as evidence that any child is capable of formal logic—or that no adult is. 


Transitive Inference 


The term refers to a logical filling in of gaps in information concerning pairs of 
elements A x B+ B x C sharing a common relation, x. Where x is a relation de- 
fined on a polar continuum (e.g. fat-thin, fast-slow) the inference A x C is always 
valid (for noncontinuum relations such as, “is the nephew of,” the conclusion is 
not necessarily valid). This is a logical operation of great power and generality 
which has been identified as a process of central importance for a number of the- 
oretical view points. 

Hunter (107), in a developmental study, established that some forms of 
presentation (of what he and others have called the three-term series problem) are 
much easier to solve than others. A linear ordering of elements and relations, as in 
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the canonical form above, is easy; variations such as B < A+B > Care more dif- 
ficult, in Hunter’s view, because they require transformation to the canonical form 
by (a) conversion of the polarity of the relation, or (b) reordering of premises. 
If these assumptions are correct, then solution time should increase with an 
increase in the number of required transformations. This is what Hunter found, 
along with a suggestion that conversion is a simpler transformation than reorder- 
ing. De Soto and his associates (45) suggested that one solves problems of this form 
by constructing a concrete spatial representation, “spatial paralogic,” of the prem- 
ises which orders the terms from most to least x. Huttenlocher (108) elaborated 
that position by adding a consideration of grammatical transformations involved 
in processing the sentences; in essence, she combined features of Hunter, who 
focused-on translation of information as presented, and DeSoto et al, who focused 
on the representation of that information. When Ss are provided with paper and 
pencil and allowed to write the names of the elements as the premises are given, 
they do write names in a spatial order (118, 193). The Huttenlocher explanation 
was challenged by Clark (33), who proposed a linguistic interpretation in- 
corporating a new variable, lexical marking, which assumes that the unmarked 
form of a polar continuum is simpler, thereby creating an asymmetry in the 
continuum. His alternative model leads to the same predictions of change in 
reaction time for various arrangements of the elements and premises as do the 
spatial models, except for one form never used in previous research—the negative 
equative (A is not as x as B). There has been some subsequent heated interchange 
(34, 109) as well as some relevant evidence (123) interpreted by its authors as 
support for Clark, but no additional light appears to have been shed on the nature 
of transitive inference. Johnson-Laird (117) proposes a classification of three-term 
series problem explanations into operational (107), image (45), and linguistic (33), 
and suggests that which strategy S uses is a function of his stage of experience with 
the problem; with experience one shifts from flexible general procedures to more 
specialized heuristics: from image to linguistic processing (no evidence is cited in 
support of this view). 

Because transitive inference is a keystone in the development of concrete oper- 
ations structures according to Piagetian theory, the age at which it is first clearly 
evidenced has become subject to scrutiny. Results of two experiments suggest that 
young children fail to reach a conclusion, not because they are incapable of 
inference, but because premise information is not stored in memory; when infor- 
mation availability is assured by providing an external memory aid (213) or by 
having the child overlearn component relations (24), performance is considerably 
improved. However, in light of a variety of recent evidence (253, 254, 267-269), 
much more limited conclusions appear to be in order: (a) children do not neces- 
sarily use logical inference if there is another, simpler, route to solution available 
(254); (b) preoperational children do not use inferential logic; (c) a number of 
procedural variables have a significant effect upon performance. Among the 
relevant variables are: ordering of elements and nature of relations included in the 
premises (254, 269), availability of consistent labels for the endpoints of the 
continuum (253), form of the questions and the nature of feedback (254, 267). The 
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effect of some of these variables could have been predicted from the three-term 
series literature, and, in fact, there is evidence (254) that spatial representation is 
used and that response time is an inverse function of element separation. 


Syllogistic Reasoning 


As might be expected in a well-established research area, most recent work 
examines fine details in relation to earlier formulations (8, 207, 210, 211); 
examines effects of some individual difference variables (169), or, in the new 
psycholinguistic mode, the effect of surface structure on processing time (143); 
and, most interesting of the lot, the logic of logical errors (27). There are few 
surprises in the results: negation, passive voice, and other relatively standard 
grammatical transforms do slow processing (143) without necessarily introducing 
error (211); older and more poorly educated Ss show more errors of reasoning, 
especially with emotionally toned meaningful content (169). In syllogistic rea- 
soning one must attend to form rather than content of the premises, and bright 
subjects do (4, 211). Where content changes produce big changes in interpretation, 
it may mean that S is poorly instructed or ignorant of logical conventions such as 
the status of amphigory (244). Ceraso & Provitera (27, p. 400) assert that “errors in 
reasoning are not the result of illogical or alogical processes, but are the result of an 
incomplete analysis of the logical structure of the syllogism,” an argument they 
support by comparing performance on traditional and modified forms of the same 
syllogism. Additional support for this view is provided by evidence on interpre- 
tation of quantifiers (173). 


Comprehension of Logical Symbols and Terms 


Much of the research on logical thought focuses upon specific structural features 
and terms. Two important classes of terms are quantifiers (modifying terms.such as 
“all,” “some,” “no,” etc) and connectives (terms describing the relation among 
major terms such as “and,” “or,” “but not,” “if then”). 

Work on quantifiers has addressed a variety of issues not readily amenable to 
integration and summary: the derivation of quantity from identity (90); 
processing-time differences (119); effect of lexical context of quantifiers upon their 
interpretation (116, 203, 204); and analysis of the errors in interpretation of “pure” 
(i.e. content-free) A, E, I, and O propositions and their compounds by identifying 
Venn diagram alternatives described by each statement (173). The last study, 
which included subjects from grade 6 through college, showed that “some” is too 
narrowly interpreted at all ages. Although interpretation broadens with age, even 
adults interpret “some” to exclude the possibility of “all”—an error which would 
not be corrected by the definition of “some” (e.g. “at least one”) provided at the 
outset of studies of syllogistic reasoning. 

There is only one study of understanding of logical symbols (75), but it is a 
major one which tested 300 children in grades | to 6 after 5 consecutive days of 
prior training on matching of instance and a symbolic expression. They also 
studied the effect of negated connectives (which was too difficult for most children 
in grades 5 and 6) and form of material. Element negation and form of connective 
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were not sources of differential difficulty. Three clear ability levels emerged with 
matching of truth value, regardless of connective, characterizing the lowest level. 
This form of “matching” has also been reported elsewhere (60, 179). 

Although the preceding study found no evidence that “or” is more difficult than 
“and,” it is undoubtedly because S was to test one instance against a symbolic 
expression; other research with a similar procedure (179) or a more demanding 
response measure (174) shows clearly that “and” is the easiest connective. The first 
study (179) examined four conjunctives (and, both-and, but, neither-nor), two 
disjunctives (or, either-or), the conditional (if-then), and the biconditional (if and 
only if), in a test where subjects from grade 2-college were to indicate whether a 
statement characterized the presented instance picture. As in (75) the youngest Ss 
responded solely on the basis of match or mismatch, with little apparent attention 
to the form of the connective. Disjunctives are harder than conjunctives, while 
conditional and biconditional are not clearly differentiated from each other. In a 
partial replication (174) of an extensive Japanese study, subjects in grades 
3-college were to respond to 16 descriptive statements by encircling all instances 
described; verbal or pictorial instances were provided. While even young children 
correctly interpret “and,” “or” is incompletely interpreted until late adolescence. 
A follow-up (171) to identify sources of difficulty in comprehending the disjunc- 
tive revealed that “or” is rarely employed in spontaneous description but that set 
union is described in terms of conjunctions of intersects. Still another use of the 
connectives “and” and “or” arises in the pseudoimperative, comprehension of 
which has been examined by Springston & Clark (243). 


Conditional Reasoning 


The general structure of the conditional reasoning problem is a major premise, if p 
then q, followed by a premise posing one of the quantities or its negation (p, p, q, 
q) and a conclusion to be drawn or tested (q, 9, p, P). The first (affirming the 
antecedent or modus ponens) and the last (denying the consequent or modus 
tollens) of the four possible arrangements are valid inferences whereas the middle 
two are invalid. Several experimenters (208, 209, 247) have presented 8 or 12 
versions of these four basic forms to college students to judge the correctness of the 
conclusion and (in one case, 247) to rate confidence in the judgment. Although the 
relative difficulty of the four forms obtained on the two studies do not coincide, 
both find modus ponens to be easy for the majority of students. A later experiment 
(248) designed to determine the truth table for Ss who judged consistently showed 
that only 3% of 73 Ss concurred with logicians, while 64% treated the conditional 
as a biconditional; however, a change in the nature of response required of the 
subjects produced a substantial lowering of that value. A developmental study 
(248a) of children in grades 3-12 showed a consistent broadening of interpretation 
with age among children who interpreted the conditional in a consistent fashion: 
at first there was simple matching and interpretation as conjunction, later as 
biconditional, and finally as a conditional according to the rules of logic. 

Evans (58, 60) varied the values of p and q in the first premise, “if p then q” 
(which in his studies were expressed as arrangements of colored forms of pairings 
of letters and numbers). In one study (60), for each rule S was to provide all 
instance combinations which would verify the rule and all which would refute it. 
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In another (58), S was to judge the validity of the antecedent (or its denial) when 
the consequent was (a) denied and (b) affirmed. In the second study, rule form had 
a marked effect: rules giving the antecedent in negative form, “if not p then... ,” 
were much more productive of erroneous inference. The negative antecedent rule 
form was also more productive of incorrect initial selection of an infirming in- 
stance than was the positive antecedent form, but it had no effect upon selection of 
initial confirming instance (which was universally a choice of TT instance—e.g. pq 
as verification of “if p then not q”; unfortunately, the proportion of Ss supplying 
all 3 noninfirming instances as verification was not reported). 

Undoubtedly the most dramatic evidence on the vicissitudes of conditional 
reasoning stems from a task designed to be an analog of practical hypothesis 
testing: Wason’s (258) four-card problem. To test a hypothesized causal relation of 
the form “p implies q,” one looks for the only combination of p and q which can 
disprove the implication, i.e. p*q. In Wason’s (258) experimental analog, Sis given 
four cards marked E, K, 4, 7, and told that each card has a letter on one side and a 
number on the other. His task is to name those cards, and only those cards, which 
need to be turned over in order to determine whether the following rule is true or 
false: Ifa card has a vowel on one side, then it has an even number on the other side. 
The correct answer is to choose E and 7 (p@). In four replications of the experiment 
only 5 of 128 subjects gave this response; the majority chose E and 4, or even E 
alone. Because of reviewed data on efficient use of disconfirmatory evidence (64) 
and accurate reasoning with modus tollens for most adults, this is a puzzling 
finding. A number of studies have been run to clarify the causes of this strange 
behavior (150, 224, 259) or to offer an explanation for it (59); most of this research 
is summarized in Wason & Johnson-Laird (259). Seemingly small changes in 
material have large effects upon performance, with “realistic” material productive 
of many more correct solutions. 


Formal Operations 


The final stage of human intellectual development, which Piaget calls formal 
operations, is characterized by the capacity for formal logical reasoning in the form 
of hypothetico-deductive thought. Assertions that logical inference occurs well 
before adolescence—possibly among preschoolers—have been pretty well demol- 
ished (e.g. 225), and there is considerable evidence that adults do not display 
logical inference under a variety of conditions. There is even a reanalysis (26) of 
one of the Inhelder & Piaget protocols which found only 8 of the 16 binary 
operations. Piaget (188) has acknowledged the weight of evidence in a reevalua- 
tion of formal operations which proposes three alternative views, of which he 
prefers the last: that all normal adults eventually attain this stage but with individ- 
ual variation dependent upon such factors as aptitudes and professional special- 
ization. Two weaker formulations of formal operations have been proposed 
which do not emphasize logic (87, 170). 


Generalization and Abstraction 


Peel (182) makes the provocative suggestion that the qualitative shift in 
conceptualization which takes place in adolescence should be described in terms 
of two separable processes: generalization (on a continuum from particular to 
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general) and abstraction (on a continuum from concrete to abstract), which can be 
independently assessed (183). He further notes “large degrees of generalization 
can be reached without recourse to abstraction, but high levels of abstraction can 
only be attained through the medium of increasing degrees of generalization” 
(182, p. 21). The distinction serves to focus a number of issues for research in 
thinking. First, although early concept studies (103a) were primarily concerned 
with understanding the processes of generalization and abstraction—and clinical 
applications (86a) still are—recent work has tended to lose sight of this central 
issue and to treat concept identification as a special case of discrimination. Second, 
there is considerable sloppiness in the use of the two terms which makes for 
needless confusion. For example, many stimulus materials described as “abstract” 
in no way reflect a distilling of some central essence of a conceptual class—rather, 
they are arbitrary and either content-free or artificially divorced from meaning. 
The Wason four-card problem is a convenient example which, in fact, serves to 
induce neither abstraction nor generalization (although there are probably indi- 
vidual differences in fixing a cut-off on the particular-to-general continuum, it is 
likely that N = 4 instances is well below any adult’s threshold for generalization). 
On the other hand, generality and abstraction can serve as very productive con- 
. tinua for the manipulation of independent variables, as Collis (35, 36) has shown 
in his studies of mathematical reasoning. 

Finally, there is a large body of cross-cultural evidence suggesting that the 
difficulties primitive peoples experience with “standard” laboratory tasks focus 
around just these issues. Scribner & Cole (223) note that the reasoning of un- 
schooled individuals may well be sound, but it is particularistic in comparison 
with the reasoning of schooled groups. If one encounters only a limited number of 
instances, perhaps there is no need for summarizing generalizations; on the other 
hand, what characterizes the “expert” is experience of many instances. Further, in 
comparing characteristics of formal schooling as contrasted with informal educa- 
tion, they note that informal training occurs in its appropriate daily-life context 
whereas school learning is detached from the context of appropriate occurrence. 
Perhaps a certain degree of detachment is prerequisite for abstraction. In any 
event, abstract thinking may well be a product of formal education. As educa- 
tional techniques for developing abstracting skills are perfected and institution- 
alized, therefore, one might expect not only to develop “corrective” training for 
special subgroups, but also to observe a long-term secular trend toward better 
performance on standard tests (130, 250). 


LANGUAGE AND THOUGHT 


There are many interrelated controversies associated with the question of the 
relation of language to thought. Most of these issues require a good deal of 
clarification before they can be subjected to experimental investigation; none of 
them is resolved by the very selective sampling reviewed in this section. That the 
whole subject is still very much alive is indicated by the volume of research which 
has appeared: the International Journal of Psychology devoted the entire first issue 
of 1971 to the subject with seven essays covering a broad range of topics. 
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Early behaviorists held that thought was nothing more than speech at a covert 
level and sought physiological evidence of muscular activity accompanying 
thought and, presumably, serving as the physiological vehicle for it. Most theorists 
today accept thought and language as separate and separable processes: there is 
language without thought and thought without language. Concern now tends to 
focus more on the ontogeny of the two processes or on the relative influence of the 
two processes upon each other. 


Inner Speech 


Early believers that thought is a covert process of talking to oneself recorded 
activity of the speech musculature during thinking and control activity for evi- 
dence in support of the assumption. McGuigan (155, 156) is perhaps the most 
articulate supporter of the conclusion that muscular accompaniments of thought 
have “some internal communication function, that they serve in the transmission 
of information.” 

- Extensive experimental support for the position is provided by the work of A.N. 
Sokolov (240), who cites two types of evidence: (a) recording of EMG activity of 
the tongue and lower lip during mental arithmetic and silent reading; the phasic 
bursts assumed to reflect implicit articulation of words are intensified with increase 
in problem difficulty and oral presentation; (b) interference with understanding of 
text or taped material by concomitant performance of reciting a poem, counting to 
oneself, or even biting one’s tongue. All these activities are assumed to interfere 
with comprehension through prevention of, or interference with, implicit speech; 
central interference (counting) is more inhibiting than peripheral (tongue-biting), 
and the effect of both diminishes over repetition of the task. 


Private Speech 


Vygotsky has proposed that talking to oneself assumes a function in promoting 
thought during the preschool years, at which age the child frequently speaks aloud 
when a difficult problem is encountered. With advancing age and socialization this 
dialogue with the self becomes interiorized. Empirical support for that position is 
now provided by several studies of Beaudichon (5, 6) with 300 5-10 year olds, 90 
5-9 year olds, and 34 5-7 year olds performing a variety of tasks. Her evidence 
shows that although total spontaneous verbalization decreases with age, the pro- 
portion of all child verbalization serving a regulatory function increases with age. 
Amount of such private speech increases with task difficulty (and failure, 46) 
especially among 5-year-old children. Among younger children enforced silence is 
associated with poor performance, while required thinking aloud improves per- 
formance. 

` Conrad (40) provides a most elegant demonstration of increasing use of private 
speech—in this instance, as a verbal code to facilitate memorization—over the MA 
range 3-11 years. A short-term memory task requiring matching of a picture series 
was used with two sets of pictures: (a) objects whose names sound alike, e.g. rat, 
. cat, bat, bag; (b) different-named objects, e.g. girl, spoon, train. Earlier work with 
adults had shown much poorer short-term recall for homophones, presumably 
because of interference generated during rehearsal. Below age 5 performance with 
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the two sets was equivalent, thereafter there was increasing superiority for the 
nonhomophone deck. 


Verbal Mediation 


Consistent evidence from the studies above, showing that after age 5 the child 
begins to generate language in the service of thought, augments earlier findings of 
Kendler & Kendler (129) on the appearance of verbal mediators at about that age. 
In recent work the Kendlers use a 3-phase paradigm with an optional shift and test 
for the determinants thereof. Pooling the data of a number of studies shows con- 
sistent increase in optional reversal shift with increasing age. A parallel ontog- 
eny has also been demonstrated (126) for optional intradimensional shift (which 
is assumed to reflect verbal mediation) as contrasted with extradimensional shift 
(which does not); ID shift increases sharply in relative frequency of occurrence 
from age 44-8. Moreover, Ss choosing ID shifts learned the initial discrimination 
faster and were brighter than their age-mates choosing ED shifts. Similar evidence 
that brighter children choose the verbally mediated shift was reported for reversal 
shift for children in kindergarten but not second grade (22). Unlike most investi- 
gations in this area, this one used two tasks, involving different stimulus dimen- 
sions, administered a week apart to determine reliability of shift behavior (50% of 
grade k and 59% of grade 2 children were consistent). Kajita (121) proposes an 
alternative explanation of the ontogeny of reversal shift in terms of logic under- 
lying the discrimination rather than use of language labels. Blank (13) offers 
perhaps the strongest evidence that most investigators have failed to find evidence 
for verbal mediation because they have used the wrong stimulus material: spatial 
rather than temporal. On the other hand, Olson (176a) offers the provocative 
suggestion that S’s language can’t affect his thought: what can S say to himself that 
he doesn’t already know? | 

Flavell (68) has proposed two explanations for why a child may fail to display 
verbally mediated behavior: he fails to produce the mediator--production defi- 
ciency—or he produces a mediator but it exerts no control over his behavior— 
control deficiency. T. S. Kendler (128) attempted to tease the two processes apart 
and to demonstrate the ontogeny of each: her results show a rapid decline of 
control deficiency during the early school years and a more gradual decline of 
production deficiency, with decrease not beginning much before grade 4. 

A popular strategy in evaluating the role of language upon thought has been to 
employ subjects lacking traditional form of language—e.g. the deaf. Work with the 
deaf continues (e.g. 13, 39, 73, 256), but not without question as to its interpreta- 
tion (15, 74). Now that chimpanzees have been trained in the use of a symbol 
system for communication, new and untapped resources for investigation of the 
relation of language and thought are opened. The Gardners’ (78) observation that 
their chimpanzee Washoe would on occasion climb into a tree and sit signing to 
herself is suggestive evidence in this regard. 


Imagery 


The Piagetian treatment of thought, through its distinction between figurative and 
operative features, tends to de-emphasize the role of language in the development 
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of thought on two counts: (a) there are other figurative or representational systems 
besides language; and (b) the basic structure of thought is described in terms of 
operative properties whose development is relatively independent of figurative 
detail. Furth (71, 72) provides an illuminating discussion of the figurative- 
operative distinction. 

After having devoted a lifetime of study to the ontogeny of operative thought, 
Piaget has turned to closer scrutiny of perception (186) and, in collaboration with 
Inhelder, of mental images (189) and memory (190) for additional light on the 
ontogeny and function of these figurative systems. The results in both the latter 
volumes are interpreted as supporting the conclusion that operational develop- 
ments precede and determine figurative ones rather than the reverse [thus, by 
implication, .. . “it is not language which will explain the operations of the human 
mind, but cognitive operations that will provide the basis for explaining language” 
(65, p. 39)]. Space does not permita review of all the evidence relevant to this issue; 
the remainder of this section will be devoted to a consideration of the major 
representational alternative to language—imagery. 

Although “imagery” is returning to popularity in psychology (195), it is clear 
that the referents of the term are as many and varied as the users. Attempts to 
converge on imagery through the identification of a factor or factors from a battery 
of mental tests range from total failure (47, 237) to only moderate success (137, 
178). An approach analogous with the behavioristic attempt to relate covert 
speech to thought has been somewhat more successful: pupillary dilation and 
EMG correlates (236) as well as alpha suppression in EEG (76) have been related 
to performance on an imagery task. Dale (43) reports interference with short-term 
retention of position information from tasks requiring vision presented during 
delay. 

By far the most compelling evidence of mental transformation of a visual image 
or images comes from the work of Shepard and his associates (227-229). Two 
studies show a very orderly linear function relating reaction time required for 
performing an instructed transformation and the number of rotations or foldings 
required for (a) matching a variable with a standard figure (229); (b) determining 
whether arrows on two sides will meet when a flat six-sided figure is folded into a 
cube (228). Questioning of subjects in the second study showed that all of them use 
visual or kinesthetic imagery in performance of the task. An inverse relation of 
reaction time in judgment of “larger” and log estimated difference in size was 
obtained for judging the larger of two animals when presented with their names 
(165). Weber & Castleman (260) made another attempt at measuring the time to 
imagine. 

Huttenlocher & Presson (110) distinguish two classes of mental rotation prob- 
lem: perspective problems (e.g. Piaget’s mountain problem), where the array is 
stationary and the observer moves with respect to it; and rotation problems, where 
the observer is fixed and must move the array with respect to himself. Perspective 
problems are considerably more difficult than rotation problems because, the 
authors suggest, they involve an additional element of relating the self to the 
‘ coordination of observer and array. This reasoning suggests that perspective 
problems should be less difficult if the child is allowed to move, and that is true. 
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With respect to cognitive maps of real space, a study (159a) of food retrieval by 
5-7 year old chimps of 5-18 food items scattered randomly while the animal was 
carried behind E, or penned in a detention cage, shows that animals (a) start on the 
side with more food, (b) choose a least distance path, and (c) rarely return to 
spots emptied earlier. 
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In this contribution, I have tried to evade the Scylla of a critical but too specific 
review and the Charybdis of a general but too uncritical review by partly giving 
way to both dangers. The first half of the allocated space is used for a critical 
discussion on the origin of the low pitch of complex sounds. Referring to the 
proceedings of the 1969 Driebergen symposium (162), largely devoted to that 
question, Harris (87) wrote in his 1972 review on audition that “probably at least 3 
years more should elapse before another review would be useful.” Even though 
this period has barely expired, the present reviewer nonetheless feels that there is 
an urgent need for a reconsideration of the pitch problem in the light of recent 
experimental evidence. If von Békésy (246) was right in stating that, to some extent, 
our so-called theories of hearing are actually only theories of pitch perception, this 
may be a sufficient argument to devote half of this review to it. The second half is 
used for a general survey of other major topics of auditory psychophysics. 


THE LOW PITCH OF COMPLEX SOUNDS 


Complex sounds as produced by musical instruments and the vocal cords are 
characterized by a pitch equal to the pitch of the fundamental. According to their 
experimental conditions, or their preferred explanations, authors have referred to 
this pitch as residue pitch, periodicity pitch, time-separation pitch, repetition 
pitch, virtual pitch, etc. I will adopt in this review the term /ow pitch as a more 
neutral denotation for the pitch of complex sounds. 


I With a few exceptions, only papers published in the years 1970 through 1973 are 
considered in this review. Articles not directly referring to the low pitch of complex sounds 
but related to subjects brought up in the discussion on the origin of low pitch are included in 
that part of the review. Limited space obliged me to leave out such subjects as speech 
perception, auditory memory, signal detection theory, temporal threshold shifts, and studies 
on animals. For the same reason, papers published in the Proceedings of the Seventh 
International Congress on Acoustics, Budapest 1971, were excluded. 
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Temporal Versus Spectral Base of Low Pitch 


It is now well accepted that the low pitch of complex sounds is not derived from the 
fundamental. Von Békésy’s (247) arguments in favor of the fundamental were 
answered many years ago by the undeniable demonstration that the low pitch does 
not disappear if the fundamental is completely masked by a band of noise. 
Apparently the harmonics are significant in one way or another. Experiments with 
amplitude-modulated signals (44, 196) demonstrated that for a constant modula- 
tion frequency, the low pitch depends upon the frequency of the carrier. This 
proves that the low pitch is based neither on the difference tone as a distortion 
product nor on the envelope of the waveform. De Boer (44) showed that the 
low-pitch behavior of these inharmonic tone complexes is in line with two alter- 
native hypotheses: (a) low pitch is derived from the time intervals between peaks in 
the fine structure near successive crests of the waveform at some place along the 
cochlear partition; (b) low pitch is derived from the power spectrum and corre- 
sponds to the best estimate of.the common divisor of the frequencies of the 
partials. A prerequisite of the first hypothesis is that the relevant harmonics are not 
resolved by the ear’s frequency-analyzing mechanism; a prerequisite of the second 
is that they are resolved. The first hypothesis assumes that low pitch is based on 
temporal peripheral cues, the second that it is based on spectral peripheral cues. 

De Boer considered the time intervals in the fine structure as the most probable 
basis of low pitch, and many investigators shared this view, explaining their 
experimental data almost exclusively in terms of temporal cues (see 162 and also 
chapters by A. M. Small and J. O. Nordmark in 236). De Boer did not exclude the 
possibility that the alternative hypothesis may hold as a complementary mechan- 
ism for widely separated partials, but for some time no one followed him in this 
respect. It was only recently that a few authors have suggested comparable views. 
We shall see in the following sections that in the light of recent experimental 
evidence the neglected rather than the favored hypothesis should get most of our 
attention. 


Effects of Phase on Low Pitch 


If pitch is related to the fine structure of the compound waveform of two or more 
harmonics, one would expect pitch to be sensitive to the phase relation among 
these harmonics. Ritsma & Engel’s (178) experiments with quasi frequency- 
modulated signals supported this supposition. Wightman (268), using iden- 
tical stimuli, could not confirm this result. On the contrary, he found that the 
low pitch was virtually invariant for shifting the phase relation from quasi fre- 
quency modulation to amplitude modulation. Patterson (148) came to the same 
conclusion by investigating the low pitch of inharmonic tone complexes, consist- 
ing of 6 or 12 partials, both for cosine and random phase conditions. This result 
was also verified by Wightman (268). The conclusion that the low pitch of complex 
sounds does not depend upon the phase relations among the harmonics 1s just 
what we should expect in the light of room acoustics, because the phase relations 
are completely random in reverberant sound fields (163). 
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Harmonics Dominant in Low-Pitch Extraction 


In 1967, both Plomp and Ritsma published data relevant for deciding which 
harmonics are most effective in the extraction of low pitch. Ritsma (176) found 
that for fundamental frequencies between 100 and 400 Hz, pitch is based on the ` 
frequency region corresponding to the 3rd, 4th, and 5th harmonics. Reconsidering 
Plomp’s (161) data, we can determine that the rank numbers of the most effective 
harmonics gradually decrease for higher fundamental frequencies, with the result 
that beyond ca 1400 Hz, pitch is determined by the fundamental itself. Bilsen (12), 
presenting his subjects with series of 2 or 3 simultaneous harmonics of an (absent) 
fundamental of 50-400 Hz against a background of white noise, found that the 
noise level needed to mask a low-pitch jump had a maximum for tone complexes 
around the 4th harmonic. Apparently this level is an appropriate measure of pitch 
perceptibility. For harmonics around the 8th and higher, the noise level required 
to mask the low pitch of the 3-component stimuli was influenced by their phases. A 
subsequent study (29a) showed that this phase effect is caused by the occurrence of 
strong combination tones of the type /\-k(/>-/;), to be discussed below. 

The finding that the low harmonics up to the 5th are most effective in the 
sensation of low pitch suggests that frequency analysis may be essential for its 
origin. Soderquist (203) corroborated earlier observations (160) that subjects are 
able to hear the individual partials of a complex of tones if their frequency 
separation roughly exceeds critical bandwidth. This means that the first 5 to 8 
harmonics of a complex tone are resolved. By means of masking experiments, 
Zwicker & Fastl (283) verified that the critical band does not develop in time but 
reflects a constant filter property of the ear (see also 208). Duifhuis (47-49) studied 
an interesting consequence of the limited resolving power of the hearing organ by 
using periodic pulses with spectra of which a single harmonic was eliminated. He 
found (47) that for up to 100 pulses per sec, this absent harmonic is audible as a 
pure tone if its rank number is above about 25. This seeming paradox could be 
explained by the transient type response of the basilar membrane to short pulses 
implied by limited frequency resolution. This response was scanned with a train of 
sine wave bursts following the individual pulses, of which the threshold as a 
function of time delay was measured (48). The resulting estimate of the quality 
factor Q of the auditory filter was Q > 7, to be compared with Q œ 6 on the basis 
of critical bandwidth data. 

The hypothesis that low pitch is based upon peaks in the power spectrum 
implies that a complex sound consisting exclusively of high harmonics not re- 
solved by the ear should not have a low pitch. Terhardt (227) noted that the 
maximal center frequency of 3 harmonics for which low pitch can be observed 
[Ritsma’s “existence region” (175)] is approximately equal to the minimal fre- 
quency separation of two tones required to distinguish their individual pitches. 
Using the same subjects for the investigation of both limits, Moore (128) checked 
this statement. Even at very low sensation levels, applied in order to avoid any 
effect of combination tones, he found that under some conditions the tone com- 
plex had a well-defined low pitch when none of the individual harmonics was 


210 PLOMP 


separately audible. He considered this to be positive evidence that the time 
intervals in the fine structure do play an essential role in pitch perception. Moore 
observed also that the introduction of noise in the frequency range below two-tone 
complexes reduced the rank number of the highest harmonic that can be distin- 
guished from the 10th to approximately the 7th as found earlier for multitone 
stimuli (160). ` 


Role of Combination Tones 


Even at low sound-pressure levels, two pure tones with frequencies fand f} may 
give rise to.aural combination tones with frequencies fj-k(f-fi). Before discussing 
their significance for low-pitch perception, I first consider recent research on 
general properties of combination tones. Helle (93) and Smoorenburg (202) in- 
vestigated the level and phase of the cubic combination tone 2/,-f, (k = 1) asa 
function of the levels of the primary tones, presenting additional evidence that the 
combination tone cannot be explained by a nonlinear frequency-independent 
term in the transfer function; combination tones with frequencies /f\-k(f,-f;) are 
much more intense for small than for large values of Gr, Smoorenburg (201) 
found that the lower limit of the audibility region of 2/,-/, depends primarily on 
the level of the lower partial. This limit appeared to be nearly the same for k = 1, 
2,..., 6. He demonstrated also that these combination tones may play a mis- 
leading part in masking experiments. This question was explored very extensively 
by Greenwood (75). He was able to explain the notch in the masking audiograms 
of pure tones or narrow bands of noise, appearing on the high-frequency side of 
the masker, by the detection of cubic combination tones. The same process that 
creates combination tones also creates combination bands. This similarity was 
confirmed by further data (76-78) from which the levels of combination bands, 
given by 2f/)-/o, fa,fı, and 2(/-fj), were estimated on the basis of the masked 
thresholds. Combination bands generated by mixtures of pure tones and narrow 
noise bands appeared to vary in level as a function of stimulus parameters in a way 
very similar to the variations in the level of combination tones produced by two 
pure tones, as measured by cancellation techniques. Zwicker & Fastl (284) pre- 
sented additional experimental evidence for the equivalence of the two methods. 
Hall (83) compared the behavior of 2f,-f, and f-fj and found that both the 
amplitude and phase angle of the f,-/, cancellation tone decrease with an increase 
of fi or f/f. For fy fy = 2: 3, the two combination tones coincide in frequency 
( = Wf, and their interference manifests itself in a phase effect (84). 

Another much disputed product of the ear’s nonlinearity is the aural harmonic. 
By measuring the phase dependence of the threshold of a 2000-Hz probe tone 
presented simultaneously with a 1000-Hz tone, Clack, Erdreich & Knighton (36; 
see also 55) estimated the level of the aural harmonic at 2000 Hz. Their results 
indicate that its level is more than 50 dB below the level of the 1000-Hz tone for 
sound-pressure levels of this stimulus up to ca 65 dB. This low level supports the 
view that the aural harmonic is insufficient to explain the monaural phase effects 
for octave complexes with comparable levels of the two partials (86, 137, 172). 

The origin of strong combination tones given by fj-k( fi) remains a difficult 
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question. All psychophysical evidence points to the cochlea as the source of the 
nonlinearity (75, 202). Since combination tones behave completely like tones 
presented to the ear, it seems inevitable that the nonlinear element is coupled back 
to the basilar membrane [but Wilson & Johnstone (273) were not able to observe 
corresponding basilar-membrane vibrations in guinea pig]. Crane (37) considered 
the mechanical impact between the hair cells and the tectorial membrane as a 
possible source of combination tones. Pfeiffer (158) and Smoorenburg (202) con- 
sidered a signal-compressing type of nonlinearity to account for both combination 
tones and suppression effects (to be discussed below). Further research may reveal 
how essential the effect is for understanding the cochlear processes. 

Returning to our pitch problem, some experiments by Smoorenburg (200) are 
highly relevant. He investigated the low pitch of two-tone complexes when, 
starting from the harmonic relation, both partials were shifted equidistantly 
(Af = 200 Hz) in frequency (also studied by Sutton & Williams 221), For harmonics 
beyond the 5th, the low pitch changed more than could be explained by the 
partials alone (either by temporal or spectral cues). Essentially this behavior was 
an extreme form of what had been described earlier as “the second effect of pitch 
shift” (196). Recent data by Patterson (148) on the low pitch of inharmonic tone 
complexes consisting of 6 or 12 partials, with a frequency spacing of 200 Hz, and 
experiments by Greenhough, Williams & Wright (74) on 3-tone complexes show 
also this effect. Smoorenburg demonstrated that combination tones taking over 
the role of lower partials absent in the stimulus, rather than the presented partials 
themselves, were responsible for the low pitch. This explanation is supported by 
experiments of van den Brink (242). His finding that “the second effect of pitch 
shift” disappears at very low sensation levels of inharmonic 3-tone stimuli be- 
comes clear by considering that the combination tones also disappear. All avail- 
able data on the pitch of inharmionic tone complexes are in accord with the 
existence of a dominant frequency region as discussed above: the pitch extractor 
uses the available partials closest to this region, irrespective of whether these 
partials are present in the stimulus or created in the ear as combination tones. 


Dichotic Versus Monotic Presentation of Harmonics 


Perhaps the most significant recent study on the low pitch of complex sounds was 
carried out by Houtsma & Goldstein (101). Two-tone stimuli were presented 
either monotically (both harmonics to one ear) or dichotically (one harmonic to 
each ear). Subjects with extensive musical training and experience listened to pairs 
of these stimuli for which the low pitch (if heard) had equal a priori chances to 
jump according to 8 different musical intervals. In order to be sure that the subjects 
had used the low pitch of the stimulus as a whole rather than the individual pitches 
of the partials as a criterion for deciding which interval had been presented, the 
two harmonics of each stimulus were randomly chosen out of the three pairs (7-1, 
n), (n, n + 1), and (n + 1, n + 2). The most important finding was that at low 
sensation levels the subjects could recognize the musical intervals equally well for 
the monotic and dichotic conditions. In both cases, the percentage of correct 
responses increased for decreasing rank number of the harmonics. Apparently the 
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presence of the peripheral compound waveform for the monotic condition—or its 
absence for the dichotic condition—is irrelevant for low pitch. The authors’ inevi- 
table conclusion was that the low pitch of the complex stimuli involved is me- 
diated by a central processor operating on neural signals derived from those 
effective stimulus harmonics that are tonotopically resolved. Bilsen’s (12) finding 
that the level of a white noise required to mask the low pitch of two simultaneous 
harmonics was equal for monotic and dichotic presentations up to n = 8, with a 
maximum for n = 4, confirms the previous results. 


Relation between Low Pitch and the Pitches of the Harmonics 


The theoretical implications of the remarkably high sensitivity of the auditory 
system to frequency shifts of pure tones already have engaged many investigators. 
From measurements on the frequency-difference limen for a tone burst, per- 
formed over a wide range of frequencies and durations, Moore (125, 126) con- 
cluded that the data are consistent with the operation of a time-measuring 
mechanism for frequencies below 5 kHz, and with a spectral or place mechanism 
for frequencies above this. Experimental data for narrow bands of noise were in- 
terpreted similarly (127). Discrimination experiments by Feth (59) with combi- 
nations of amplitude and frequency differences did not support the hypothesis 
that both parameters are encoded by a single mechanism. Tsumura, Sone & Ni- 
mura (241) investigated the detectability of frequency transitions for tone pulses 
of various durations. Ronken (187) found that equal effective bandwidth does 
not make short tone bursts with different envelopes equally discriminable. The 
frequency discrimination of a brief tonal signal seems to be rather independent of 
the addition of an interfering tone (113, 188). Nabélek, Nabélek & Hirsh (134, 
135) performed extensive measurements on the pitch of tone bursts of chang- 
ing frequency. For a comparison of the pitch behavior of pure tones and the low 
pitch of three-tone complexes, both of short duration, see Metters & Williams 
(123). 

Kay & Matthews (107) published experimental evidence strongly suggesting 
that the human auditory pathways contain specific channels for frequency 
modulation. This highly interesting conclusion was drawn from experiments 
demonstrating that the ear’s sensitivity for detecting frequency modulation is 
selectively diminished after exposure to a conditioning tone more deeply fre- 
quency modulated at almost the same rate. 

The pitch of a pure tone does not depend exclusively upon its frequency but, to 
a small extent, also on other parameters such as a superimposed tone or noise 
(232). What are the consequences for the behavior of low pitch? Walliser (251) 
found that both as a function of sound-pressure level and of the level of super- 
imposed noise, the shift of the low pitch of a group of harmonics is correlated with 
the shift of a pure tone coinciding with the lowest harmonic, but not with a pure 
tone coinciding with the (absent) fundamental. In a subsequent article (252), he 
showed that the frequency-difference limen for a complex tone is determined by 
the harmonic with the smallest frequency-difference limen rather than by the 
fundamental. Terhardt (226) observed that the subjective octave interval for pure 
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tones, corresponding to a frequency ratio slightly larger than 1 : 2, can be related 
to the upward shift of the pitch of a pure tone if a lower tone is added. Further- 
more, he found (228) that the low pitch of a series of harmonics is generally 
somewhat lower than the pitch of their fundamental. The subjective octave inter- 
val for complex tones could be predicted from these two effects (compare 220). 

Another related approach was applied by van den Brink. Some preliminary 
observations (242) with stimuli consisting of three harmonics had indicated that 
the binaural diplacusis (here defined as the Af/f required to observe the same pitch 
if two comparable sounds are presented alternately to the two ears) for the low 
pitch of these stimuli approximates the average diplacusis of the spectral com- 
ponents individually rather than of the (absent) fundamental. More extensive 
experiments (to be published in Acustica) confirmed this result for harmonics up 
to the 8th. In a subsequent experiment, van den Brink (also to be published in 
Acustica) included dichotic stimuli consisting of two out of the three harmonics 
presented to one ear, coinciding in frequency with the corresponding harmonics of 
the monotic stimulus, and one partial presented to the opposite ear. The deviation 
of the latter partial from its harmonic value, required for equal low pitches of the 
monotic and dichotic stimuli, agreed excellently with the binaural diplacusis for 
that frequency. These are only two out of a large number of experiments consis- 
tently demonstrating that the low pitch of a complex tone is tightly coupled to the 
individual pitches of the harmonics from which the low pitch is derived. It seems 
very difficult to reconcile the results presented in this paragraph with any theory in 
which the compound waveform of two or more harmonics is considered to carry 
any pitch information. 


Discussion 


In summarizing the experimental results presented above, we may conclude that 
there are five consistent and powerful arguments against the hypothesis that the 
low pitch of complex sounds is derived from the time intervals between peaks in 
the fine structure of unresolved harmonics: (a) the irrelevance of phase; (b) the 
dominance of low-order harmonics; accentuated by (c) the role of combination 
tones; (d) the similar results for monotic and dichotic presentations; and (e) the 
correlation with the pitches of individual harmonics. On the contrary, the recent 
_ data present strong evidence in favor of the hypothesis that frequency resolution is 
a prerequisite of hearing low pitch. De Boer’s (44) alternative, and neglected, 
hypothesis that low pitch is based on spectral cues should be accepted rather than 
the hypothesis that low pitch is based on temporal cues. 

This conclusion implies that peaks in the stimulation pattern along the cochlear 
partition, as a result of the frequency-analyzing capacity of the cochlea, are 
essential as a basis for low pitch extraction. This suggests that as a modification of 
the classic “place theory,” a central processor derives low pitch from the locations 
of the peaks of the stimulation pattern. Since for these peaks the nerve discharges 
are synchronized with the frequencies of the harmonics (up to ca 4000 Hz), we 
should not exclude the possibility that for the individual frequency components, 
temporal rather than spatial information is used as the neural code for their 
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frequencies. It will be obvious that in this concept time plays a quite different role 
than in the criticized view that low pitch is based on the time intervals between 
peaks in the fine structure near successive crests of the compound waveform at 
some place along the cochlear partition. Available experimental data (e.g. 42, 126, 
127) do not give conclusive evidence for excluding either topical or temporal 
information as possible codes for the frequencies of the harmonics. Similarly, the 
observed relation between low pitch and the pitches of the harmonics is not 
decisive. This relation only implies that both pitch modes are based upon the same 
neural information. If the effect of secondary parameters on pitch and the phe- 
nomenon of binaural diplacusis are related to the locations of the peaks of the 
stimulation pattern along the cochlear partition, this would speak well for the 
tonotopical code. There is, however, no compelling reason to exclude the possi- 
bility that the pitch variations have their origin in an imperfect transformation of 
the temporal code at some postcochlear center of the auditory pathway. Addi- 
tionally, there is no reason to assume that the central pitch processor extracts low 
pitch from the pitches of the harmonics rather than that both pitch modes are 
extracted from the same neural information. The latter view can explain Moore’s 
(128) finding, referred to above, that the audibility of the pitches of the indi- 
vidual harmonics is not a necessary condition for hearing low pitch. 

Investigations on the production of a faint but distinct pitch when subjects are 
presented with stimuli consisting of a signal plus its time-delayed replica have in 
the past been interpreted as demonstrating that the time pattern of the nerve 
discharges does play a role in the sensation of low pitch. If these stimuli are 
presented monotically, the results can be interpreted satisfactorily in terms of 
spectral analysis. For instance, the superposition of white noise and its time- 
delayed replica, with a delay of r seconds, results in a sound spectrum with peri- 
odic peaks for n/r Hz, explaining why a “repetition pitch” corresponding to 
l/r is observed. Continuing earlier experiments, Bilsen & Ritsma (15, 16, 177) 
were able to show that the concept of a dominance region also holds for repetition 
pitch. More intriguing was the finding by Bilsen & Goldstein (13; see also 61) that 
under favorable conditions this pitch does not disappear if white noise is presented 
to one ear and the delayed version to the other. In this case, no spectral cues are 
available at the peripheral level. The authors conclude that “known binaural and 
monaural phenomena on pitch of complex sounds appear to be compatible witha . 
generalized place theory of pitch in which a central pitch mechanism reads across 
the characteristic frequency dimension of a centrally presented spectrum” in 
which interaural differences in timing are reflected. 

What central processes are involved in extracting low pitch remains an open 
question. No available physiological data can help us. The models proposed in 
recent years are, therefore, rather speculative. Whitfield (266) suggested that low 
pitch may be the result of some sort of pattern recognition. Walliser (251, 253; see 
also 227) explained his experiments by a model in which the pitch of complex 
sounds corresponds to that subharmonic of the lowest present harmonic which is 
nearest to the frequency of the envelope of the signal’s waveform. This model was 
modified and detailed by Terhardt (229-231), who considered low pitch to be 
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based upon the pitches of the harmonics as the product of a process of auditory 
“Gestalt” perception in which a “learning process” is included. Wightman (269) 
presented a formalized model in which the pattern of the peripheral neural 
activity, reflecting the power spectrum of the stimulus, is assumed to be Fourier 
transformed into another pattern of which the positions of maximal activity 
determine the low pitch. Goldstein (71) proposed a statistical theory in which low . 
pitch is the result of the following successive steps: perfect spectral analysis, noisy 
projection of the spectral components, optimal matching to a harmonic pattern, 
estimate of the rank numbers of the harmonics and the fundamental. It seems 
clear that there is an urgent need for physiological data on the processes involved 
in the extraction of low pitch from a complex sound. 


FURTHER MONAURAL FUNCTIONS 


Loudness 


Just as for the low pitch of complex tones, rather established views on loudness 
have been challenged in recent years. Warren (256) presented experimental 
evidence suggesting that the generally accepted sone scale reflects the influence of 
experimentally induced biases. Free from known biasing factors, judgments of 
half-loudness correspond to ca 6 dB attenuation for tones. [For the effect of 
procedure on loudness functions, see also Ward (254, 255) and Schneider et al 
(194), and for interindividual differences see de Barbenza et al (43).] A similar 
approach for speech under reverberation (258) seems to support the hypothesis 
that half-loudness judgments are equivalent to estimates of the effect of doubling 
distance from listener to sound source. If this is true, traditional procedures for 
calculating loudness (102, 149, 214, 215; see also 8) should be reconsidered. 

In addition, the use of simultaneous dichotic loudness balance procedures (as in 
263 and 280) for investigating per-stimulatory loudness adaptation has been 
criticized by Fraser, Petty & Elliott (62, 157). They showed that the adaptation 
effect disappears if binaural interactions are reduced or eliminated. Further in- 
vestigations (216-219; but see also 24, 92, 130, 271) supported this conclusion. 

Other investigators studied the effect of various parameters on loudness. Scharf 
& Fishken (193) found that the ratio of binaural to monaural loudness for tone 
and noise is lower than the traditionally accepted value of 2.0. Levelt, Riemersma 
& Bunt (115) concluded from their experiments that binaural additivity of loud- 
ness holds for nonzero stimulation of the ears. Contralateral noise reduces the 
loudness of a 1000-Hz tone at low levels, but increases the loudness at moderate 
levels (190). The apparent loudness of a tone pip can be increased by 15 dB or 
more if it is preceded by a tone burst to the contralateral ear (63). Boone (22) 
provided evidence that the time constant governing the build-up of loudness of 
periodic tone bursts is of the order of 100 msec. The monaural loudness level of a 
250-msec tone burst is only 1 phon below the level for durations of 2 sec or longer 
(57), whereas a fast onset seems to contribute to the loudness (69). This onset effect 
holds also for periodic acoustic transients (32; see also 33). Pure-tone equal- 
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loudness contours appear to depend upon the frequency of the standard tone 
(124). In another study (194), these contours were derived from magnitude esti- 
mates. The most comfortable loudness levels of various sounds, extensively ex- 
plored as a function of several parameters (243; see also 245), appear to scatter 
around 50 dB sound-pressure level, but are significantly lower for ascending than 
for descending tones (275). Stephens & Anderson (210, 213) found values around 
100 dB SPL for the uncomfortable loudness level of a 1000-Hz tone (further 
loudness studies include 34, 103, 108, 136). 


Timbre of Steady-State Complex Sounds 


Von Bismarck (248-250) studied the timbre dissimilarities of various complex 
stimuli by means of semantic scales. Factor analysis applied on the scores of 35 
sounds along 30 different scales resulted in “dull-bright” as the main factor (249). 
Further experiments showed that brightness (or sharpness) is primarily deter- 
mined by the absolute frequency region of the loudest spectral components (250). 
Plomp & Steeneken (163) investigated the extent to which characteristic timbres 
are blurred by reverberation in a large hall. 


The “Continuity Effect” and Lateral Suppression 


As early as 1957, Thurlow (233) reported that under certain conditions the less 
intense component of a pair of alternating tones is heard as sounding contin- 
uously. This “continuity effect” appears to represent a general property of the 
. hearing system; if an extraneous sound such as a cough or a tone burst completely 
replaces a speech sound in a recorded sentence, listeners believe they hear the 
missing sound (11, 257, 259, 261). The maximum sound-pressure level at which the 
continuity effect still holds resembles, as a function of frequency, the masked- 
threshold curve (52, 260), depending also slightly on the angular separation of the 
two tone sources (52). All experimental evidence supports the view (97-99) that 
the continuity (or pulsation) threshold of a certain sound stimulus, measured as a 
function of frequency, reflects the neural activity pattern induced by that sound 
along the cochlear partition. 

Houtgast (97-100) adopted this effect as a psychophysical procedure for inves- 
tigating lateral suppression (or inhibition) in hearing, already known neurophys- 
iologically. He demonstrated (97, 99) that a noise with a steep positive or negative 
slope at a particular frequency shows significant edge effects, comparable to Mach 
bands in vision, if the noise and the probe tone are presented nonsimultaneously. 
This effect might be essential in explaining the observed pitches corresponding to 
the edges of a noise band (58, 70), as well as the phenomenon known as Zwicker’s 
“negative afterimage” (117). The absence of lateral suppression in simultaneous 
masking (confirmed by Rainbolt & Small 173) can be understood by assuming that 
for that condition both the masker and the probe tone are subjected to the same 
suppression mechanism. Houtgast (98, 99) showed that a strong tone may reduce 
the level of a weaker tone of ca 20% lower frequency by more than 15 dB. The 
extent to which a tone is suppressed by simultaneous wide-band noise (99, 100) 
agrees quantitatively with the loudness reduction of a tone when noise is added. 
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Hellman’s (94) finding that the loudness of a 1000-Hz tone tends to grow more 
rapidly in the presence of a narrow-band noise than of a wide-band noise can be 
explained by lateral suppression. Experiments with rippled-spectrum noise indi- 
cated (99) that the effective bandwidth of the ear’s frequency-analyzing mechan- 
ism in nonsimultaneous masking is about half the critical bandwidth as found in 
simultaneous masking, which demonstrates quantitatively the sharpening or en- 
hancement effect of the lateral suppression mechanism. 


Origin of Tonal Consonance and Dissonance 


The perceptual singularity of musical intervals consisting of" two simultaneous 
tones with a simple frequency ratio was studied by means of identification tests of 
briefly presented chords (164). The results showed clearly that the chords were 
confused in terms of their width rather than their frequency ratio simplicity. This 
confirms the view that tonal consonance is related to interval width for pure tones. 
Chords consisting of tones of which the frequencies of the harmonics were com- 
pressed gave sensations consistent with Helmholtz’s criterion for dissonance (198). 


Detection of Sounds 


Under this heading, I will report on a number of investigations in which the 
detection of a stimulus, whether or not superimposed on another sound, is 
involved. 

Watson, Franks & Hood (262) found the sound-pressure level of 150-msec tone 
bursts required for 76% correct detection in a two-alternative forced-choice 
procedure to be in good agreement with the ISO audiometric-zero standard. On 
the basis of additional data, extension of this standard’s present limit of 8,000 Hz 
up to 18,000 Hz was proposed (89, 140). The dependence of hearing threshold on 
the time the subject has been in quiet seems to be negligible (209). Anderson & 
Whittle (4; see also 25, 132, 205) provided evidence that the old “missing 6 dB” 
effect (discrepancy in threshold between free-field and earphone listening condi- 
tions) is caused by the physiological noise produced under a supra-aural earcap. 
Further evidence showed that under quite different conditions the ear’s sensitivity 
is slightly enhanced by a preceding or succeeding stimulus (129, 138, 139). The 
interaction of the auditory and visual systems in signal detection was studied by 
Fidell (60). Karlovich et al (106) verified that the absence of temporal integration 
when the duration of a 125-Hz tone burst is varied between 200 and 5 msec is due 
to the spread of energy toward the higher, more sensitive frequency region. If this 
spread was checked by means of a low-pass filter, threshold behavior agreed with 
the integration curves as studied by many investigators (66, 150, 211, 212, 225). 
The role of energy spread if a tone burst is superimposed on a tone of the same 
frequency was demonstrated by Leshowitz & Wightman (114). 

Many studies were devoted to the masked threshold of a brief probe sound as a 
function of various parameters (for recent reviews see 51, 54, 192, 286). Mulligan 
& Elrod (133) studied the detection of'tones in noise in terms of signal-detection 
theory, whereas Reed & Bilger (174) considered the masked thresholds in relation 
to the various estimates of the critical bandwidth. Ahumada & Lovell (3) con- 
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cluded that the detection of a tone burst in noise is determined by more than one 
feature. The deletion of a tone appears to be much harder to detect than the 
insertion of the same tone (197). Canahl (31) observed that four equally spaced 
tones mask a central (1000 Hz) signal much more than would be predicted from 
two tones. In comparing two with one masking tones, Chocholle (35) found a 
similar effect. Patterson (145) noted that the amount of masking is related to the 
short-term spectrum of brief maskers. Even for narrow noise bands, the noise 
masks a pure tone more effectively than the reverse (95). Leshowitz & Cudahy 
(112) investigated the amount of masking of a 5-msec probe tone at various delays 
after the onset of a 500-msec tonal masker of constant frequency. Masking ap- 
peared to be largest at the moments when the gradients of the maskers were 
steepest. Analogous to critical bandwidth, a critical masking interval of 10 to 14 
msec has been proposed (151; see also 189). Zwicker & Schiitte (285) studied, also 
by means of a probe signal shifted in time, the extent to which the ear is able to 
follow the random temporal changes of the sound level of narrow bands of noise 
and found that this ability is limited to bandwidths below ca 100 Hz. 

The threshold for clicks in simultaneous masking as well as in forward and 
backward masking of 500-msec noise bursts was investigated by Wilson & Carhart 
(274). Presenting the click between two bursts resulted in threshold shifts greater 
than either forward or backward masking would have produced if operating 
alone. This condition compares with Duifhuis (48), who measured the threshold 
for brief probe tones presented with variable time delay with respect to the pulses 
of a pulse train. The nonadditivity of forward and backward masking was also 
observed by Patterson (144). Robinson & Pollack (180; see also 169) interpreted 
this effect in terms of a temporal integrating period. For noise bursts shorter than 
200 msec, the amount of forward masking appears to depend primarily upon the 
product of duration and intensity of the masker (282). The effect of forward and 
backward masking on speech intelligibility was studied by Dirks & Bower (45). 
Zwislocki (287) published a quantitative psychophysiological theory of central 
masking which appeared to predict accurately single-unit characteristics of the 
auditory nerve and of primary-like populations in the cochlear nuclei and the 
superior olive. 

Since experiments on intensity discrimination can be interpreted as detection 
experiments, some are included here. Viemeister (244) concluded that Weber’s 
law holds also for a pure tone of 950 Hz if frequency regions at and above the 
second harmonic of the signal are masked by filtered noise. The generality of this 
conclusion was criticized by Schacknow & Raab (191). Intensity discrimination 
data on broad-band and high-pass filtered clicks were also consistent with Weber’s 
law (152). Rochester (184) found that although the processes of the detection and 
of the discrimination of a change in duration seem to be very similar for weak 
signals presented in a background of continuous noise, corresponding experi- 
mental procedures gave different results. Luce & Green (116) presented a psy- 
chophysical theory which could account for Weber’s law and other auditory 
functions. Using sinusoidally modulated noise, Rodenburg (185) studied the 
just-noticeable modulation depth of noise as a function of modulation frequency 
and found that the resulting temporal transfer function has a cutoff frequency of 


AUDITORY PSYCHOPHYSICS 219 


40-80 Hz and an asymptotic slope of 6 dB per octave for higher modulation 
frequencies. Finally, I refer to three studies on the intensity perception of tone 
pulses (10, 23, 171). 


Temporal Acuity 


A number of papers have been published recently dealing with the interesting 
question of the shortest time interval within which the ear can discriminate the 
order of auditory events. Patterson & Green (146, 147; see also 72) presented 
subjects with pairs of very brief transient signals having identical energy spectra 
but different phase spectra. The results of these experiments suggested that the ear 
can discriminate differences in temporal order as small as 2.5 msec. Further 
research (73) indicated that this value is largely independent of frequency. Babkoff 
& Sutton (5; see also 186) used pairs of clicks of unequal intensity. Stimuli with 
different order of the two clicks could be discriminated if the interpulse interval 
exceeded ca 2 msec; .the score of correct responses had a second minimum for 
intervals of 12-15 msec, whereas the perception of temporal order itself is a 
dominant cue only for longer intervals. The minimum detectable gap between 
successive tonal pulses, being as small as 2.5 msec for 3-msec pulses, appears to 
increase as a direct function of both the pulse duration and the frequency disparity 
between successive pulses (272). (Further related studies include 85, 111, 166-168, 
179.) 


Subjective Duration of Sounds 


Zwicker (281) found that subjects are able to adjust the duration of a sound burst, 
a silent gap, or a time interval between two short pulses with an accuracy of ca 10% 
to the duration of a standard. For equal physical durations, the sensation of a gap 
or a time interval seems to be about half as long as of a burst. These experiments 
were continued by Burghardt (26-29). Below 400 msec, asymmetries between the 
subjective durations of a burst and a gap up to a factor of 5 were observed (29). 
This effect disappeared for durations above ] sec. Below ca 3000 Hz, a short 
high-frequency sound burst has a substantially longer subjective duration than a 
low-frequency burst of the same physical duration; this trend is reversed above 
that frequency (28). Related investigations were carried out by Abel (1, 2). For 
durations above 50 msec, the Weber fraction for the discrimination of the duration 
of sound bursts is ca 0.1, in agreement with Zwicker’s (281) results, whereas the 
discrimination for shorter durations is more in accord with a square-root relation 
(1). The Weber fraction for the discrimination of temporal gaps showed a local 
minimum at ca 2.5 msec and a maximum at 10 msec (2). 


BINAURAL FUNCTIONS 


Reviews on research on binaural hearing up to about 1966 by A. W. Mills, L. A. 
Jeffress, N. I. Durlach, and J. V. Tobias can be found in Volume I of Foundations 
of Modern Auditory Theory (237). A very extensive review (522 references) on 
spatial localization of sounds, including animals, was given by Erulkar (56). 
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Localization of Sounds 


Harris (88) repeated and extended classic experiments on the minimum audible 
angles for tone sources in a free-field situation. Low values of 1°-2° over the 
frequency range from 125 through 6400 Hz were obtained. The data suggest that 
eliminating either the interaural time cue or the interaural intensity cue has only 
slight effects on the minimum audible angles (compare 91). When the two cues 
were rendered noncongruent, the difference in arrival time appeared to be pre- 
potent over a difference of intensity up to ca 7 dB. In a separate paper (90), results 
for monaural and binaural localization were compared (see also 53). Stimulus 
durations of more than ca 2 sec are required to make good use of head movements 
for localization (235). For sources located in the median plane, localization is 
influenced primarily by the irregularities of the pinna (64). Toole (238) concluded 
that small head movements are not the main factor underlying the “externaliza- 
tion” of sounds. 

Several of these questions have been studied extensively in some German 
laboratories in their search for stereophonic sound reproduction systems with a 
high degree of naturalness. An extensive review can be found in Blauert (20a). The 
same author (17) investigated the role of the outer ear in sound localization in the 
median plane. For time delays exceeding ca 0.5 msec at the ears, the precedence 
effect seems to be valid also in this plane (19; but see also 109). For high-frequency 
signals, even vertical shifts of sound sources corresponding to less than 4° were 
discriminated (264). It has been demonstrated (39, 41; see also 38, 40) that the 
acoustical crosstalk resulting from sound diffraction at the head when stereo- 
phonic signals are reproduced with two loudspeakers can be compensated for by 
special filters. Listening tests showed that in this way, even with two loudspeakers, 
nearly ideal localization for the entire horizontal plane including front-back dis- 
crimination can be obtained. The typical disadvantage of headphones for stereo- 
phonic listening, namely that the sound images are localized in the head (see 159 
for a theory of this effect), has been conquered successfully by means of electrical 
circuits simulating loudspeaker-listening conditions (110, with 77 references; 21; 
see also 265). 


Lateralization of Sounds 


The fact that with headphones sounds are lateralized rather than localized is no 
problem for those investigators studying the lateralization of binaural signals. 
Yost, Wightman & Green (279) presented the ears of subjects with identical 
band-pass filtered clicks and found that the discrimination of lateral position of 
the sound image, induced by delays between the times of arrival at the ears, 
depends primarily on the low-frequency contents of the clicks. This holds also for 
localization (7). Using antiphasic clicks, Bilsen & Raatgever (14) were able to show 
that lateralization is defived from a dominant frequency region around ca 650 Hz. 
It is of interest that the same frequency region dominates dichotic repetition pitch 
(13). Thurlow & Jack (234) presented evidence that the lateralization of a noise 
stimulus is affected by a preceding adapting noise stimulus with a time-delay 
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difference at the two ears. Perrott & co-workers (153-156) studied the limits of 
binaural fusion as a function of interaural frequency and signal duration. Pollack 
(165) studied the interaural correlation-discrimination threshold for pulse trains, 
and Dunn (50) investigated the effect of unilateral masking on the lateralization of 
binaural pulses. 

Lateralization of a sound image can be achieved by introducing a time 
difference or an intensity difference between the signals presented to the ears. 
Traditionally the interaction of time and intensity has been studied in terms of a 
binaural “trading ratio” as if the effect of one of these parameters can be com- 
pensated for by the effect of the other. Recent research has brought forward 
further evidence that this view is not correct. Blauert (18, 20) found that oscilla- 
tions of a sound image between the ears are detectable up to a slightly but 
significantly higher rate if the oscillations are based on interaural intensity differ- 
ences rather than on interaural time differences. Hafter & Carrier (81) requested 
subjects to detect the difference between low-frequency tonal signals, leading in 
time to one ear but more intense to the other, and signals without any difference at 
the ears. The results confirmed the view of an incomplete compensation and were 
used by the authors for a model of time-intensity trading (see also 6, 46, 68, 82). 


The Effect of Masking-Level Difference (MLD) 


In other approaches, the effect of MLD, usually defined as the improvement in the 
binaural masked threshold when the polarity of either the masker or the signal to 
one ear is reversed, was included. In an earlier experiment, Hafter & Carrier (80) 
had studied MLD for a tonal masker and a tonal signal of the same frequency and 
duration. Introduction of interaural time and interaural intensity differences of the 
signal suggested that the trading ratio depends upon experimental condition. Yost 
(276) observed that at high frequencies the interaural phase difference is no longer 
usable as a cue. Contrary to the homophasic condition, the Weber fraction de- 
creases for the antiphasic condition for increasing values of the phase angle of 
addition (277). (For a discussion on possible artifacts in this type of experiment see 
267.) Similar experiments, but with both the masker and the signal derived from 
the same 50-Hz band of noise, supported the view that the time and intensity cues 
do not cancel perfectly (105, 118, 119). McFadden, Russell & Pulliam (121) found 
that in binaural listening the masking pattern of an antiphasic 100-msec tone 
signal with a homophasic 4000-Hz continuous masker is characterized by two local 
minima at about + 10 Hz from the masker. Further related experiments are found 
elsewhere (30, 122, 143, 195, 207; see for a survey 104). 

In the more traditional MLD experiments, wide-band noise is used as a masker. 
MLD appears to be independent of signal duration and, probably due to physio- 
logical noise, is reduced at low sensation levels (204). At hearing threshold, the 
effect is very small (67). McFadden & Pulliam (120) observed that a lower level of 
a monaurally presented signal in a binaurally presented noise masker is sufficient 
for detection rather than for lateralization. Wilbanks (270) varied the interaural 
correlation of both his 135-Hz wide noise signal, centered at 250 Hz, and the 
masker, and noticed that with a correlated masker detection is about 14 dB better 
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with an uncorrelated signal than with a correlated signal (see also 206). Robinson 
& Dolan (182) confirmed that for uncorrelated noise, MLD for tones decreases for 
increasing signal frequency. For correlated noise, the detection threshold of a 
500-Hz tone in the antiphasic condition is lower for large than for small time 
intervals between the onsets of the noise and the signal (183; see also 9, 240). 
Townsend & Goldstein (239) investigated MLD at suprathreshold levels by means 
of loudness matching. It appeared that the antiphasic advantage is greatly reduced 
or absent for sensation levels above 20 dB. It is likely that loudness determines the 
frequency discrimination performance for binaural signals (65). Using complex 
sounds (natural and synthetic vowels) as signals, Mosko & House (131) confirmed 
that maximal MLD occurs when the components in the region of 250 to 500 Hz are 
interaurally out of phase. At low sensation levels, resolution of frequency and 
amplitude is better with antiphasic than with homophasic signals (96). (Further 
studies include 181, 222-224, 278.) 

This large number of recent MLD studies illustrates how fashionable this topic 
is. Perhaps the most interesting finding is that MLD exists also in forward and 
backward masking (170, 199). This means that the detection of a low-frequency 
tone, separated from the masker by a short silent interval, improves if the phase is 
reversed at one ear. This may have important implications for models related to 
MLD, such as the lateralization model proposed by Hafter (79) and the correlation 
model proposed by Osman (141, 142). 
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Space limitation in the Annual Review of Psychology prevents serious in-depth 
treatment of a topic to a degree that would satisfy the expert. Nor does it allow for 
background information so necessary for clear exposition for the student. The 
voluminous material that emerges during the several years between particular 
reviews can only be skimmed, thereby neglecting and possibly offending a number 
of deserving researchers. This is particularly true with the present review. It will 
attempt to cover studies using two widely accepted techniques, the Average 
Evoked Response and the Contingent Negative Variation that have provided 
volumes of research over the past decade. Neither of these has been systematically 
reviewed in the Annual Review. Research utilizing these techniques has literally 
metastasized to include practically every aspect of behavior, species ranging from 
rat to man, and clinical applications from dentistry to psychiatry. To review this 
vast area in the limited number of pages allotted is presumptuous. But this I do 
hoping that in broad outline most of the major areas can be covered, and this I do 
with an apology to many investigators and friends whose interesting work is 
deserving of more description and elaboration than space will permit. 

The following general references (33, 72, 73, 104, 127, 201, 203, 218, 233, 265, 
291, 292, 318) provide invaluable background. Four deserve special comment. 
Regan’s (233), the most complete, covers all aspects of most techniques; Shagass’ 
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(265) is especially valuable to psychologists and psychiatrists; Perry & Childers’ 
(218) covers the visual response; McCallum & Knott’s (203) is based on reviews 
and papers at the 2nd International CNV Congress. All are scholarly and infor- 
mative. 


THE EVOKED RESPONSE (ER) AND THE CONTINGENT 
NEGATIVE VARIATION (CNY) 


The growing popularity of evoked response research has been paralleled by 
technological advances in the development of small and inexpensive computers 
which enable investigators to sum and average evoked responses rapidly and 
efficiently. Also contributing has been the repeatedly reinforced expectation that 
the evoked response (ER) technique will be more useful in assessing brain func- 
tion than the conventional EEG and other electrophysiological analyses. In con- 
trast to the EEG, ERs are under stimulus control, thus providing additional 
information about the different sensory systems, functionally different pathways, 
and brain areas involved in the response. ERs lend themselves to more quantifi- 
able analysis than does the EEG trace. 

The computerized ER that emerges from the brain’s background electrical 
turbulence is a complex pattern of as many as 15 identifiable wave components or 
reproducible patterns of polarity, amplitude, and duration that lasts for hundreds 
of milliseconds, Each component can be accurately measured in amplitude and 
latency. Characteristics of these electrical cerebral patterns indicate that they 
represent activity initiated by sense organs, cortical and subcortical brain areas, 
and anatomically and functionally different neuronal aggregates. These patterns 
may also indicate a relationship between the evoked neuronal activity and the 
subject’s perception of the evoking stimulus and its significance to him at the time 
of perception. Finally, the ER faithfully reflects the nature of the evoking stimu- 
lus; responses to light, sound, and touch all differ, as do ERs to patterned and 
colored light, or auditory stimuli when they convey meaning. 

The Contingent Negative Variation (CNV), a slow-rising, surface-negative 
brain potential usually depends upon the association or contingency of two suc- 
cessive stimuli. Conditions to produce the CNV are so similar to Pavlovian con- 
ditioning that the technique gained immediate and widespread acceptance by 
psychologists. 


Reliability, Stability, Validity: Methods and Techniques: 
Statistical and Mathematical Analyses 


Reliability and stability of ERs were early concerns of evoked response investiga- 
tors as responses were reported to be affected by a variety of conditions. However, 
most studies indicated that the ER was reliable in the adult (71, 109, 182, 267). In 
some, mean product-moment correlations of 0.88 were obtained between visually 
evoked responses (VERs) separated by months (109). Recent studies have 
involved infants and children and report that reliability and stability increase with 
age (53, 119-121). Callaway & Halliday (53), recording from 120 children 6 to 15 
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years of age, reported that variability of the visual (VER) and auditory (AER) 
evoked response decreased with increasing age. Ellingston et al (120, 121), study- 
ing variability of the VER in the human infant and adult, found that the adult 
response was far more reliable than that of the infant. Despite the impressive 
correlations, a procedure often adopted is to include a reliability measure in each 
study itself. 

Validity of the ER as representative of neural activity was a source of concern 
and argument. Are components of the ER of neural origin or are they the sum of 
artifacts, myogenic responses, extracerebral activity, or all of these? These possi- 
bilities were seriously investigated by Bickford and others (34-36, 104). Bickford’s 
criticisms were worrisome. With enthusiastic admonishment he reported data 
showing that with different postures, such as a change in facial expression, 
frowning, raising the eyebrows, muscle tension, etc, new waves emerged, others 
changed, still others disappeared. 

The only answer to Bickford’s criticism was to show a relationship between the 
gross wave components ofthe ER recorded at scalp and those recorded at cortex. 
A number of experimenters succeeded in doing just that (44, 47, 75, 95, 96, 140, 159, 
171, 177, 305). There was good concordance between the response recorded from 
scalp and that from dura, although the amplitudes of the scalp recordings were 
often smaller. Differences that did occur among the various studies could be 
accounted for by the methods used. For example, some studies (96, 140) analyzed 
only the first 125 msec of the response as contrasted to others (159) which de- 
scribed both “early” and “late” ER components separately. There remained the 
question of the differences in the wave form of the ER when electrodes were 
placed over and under the dura. Geisler & Gerstein (139) demonstrated that the 
major part of the diffusion or spreading of the potential distribution may actually 
be due to dura and cerebro-spinal fluid under the dura. Potentials recorded from 
dura and scalp tended to resemble each other more than those recorded subdur- 
ally. 

Methods and techniques received systematic attention not only in papers but in 
a number of symposia and books. The NASA symposium (104) dealt specifically 
with these problems: methods involved in cross-modality comparison of ERs 
(Goff 144); methods in the CNV (Cohen 73); data analysis techniques in 
ER research (Donchin 100); and the topic of psychological variables in the ER re- 
search (Sutton 288). The nature of the evoking stimuli—visual, auditory, etc—was 
found to be critical to ER research and received systematic attention in a number 
of studies. Visual stimuli were reviewed and evaluated in detail by Perry & 
Childers (218, Chapter 2) and Regan (233, Chapters 2 and 5) and included studies 
in which color, pattern, frequency, intensity, steady state, binocular, and moving 
stimuli were utilized. Similarly treated were somatosensory stimuli by Shagass 
(265) and auditory by Davis (86, 87, 89, 90) and Rapin & Graziani (224, 225). 

Statistical and mathematical analyses of bioelectrical data were the topic of a 
number of studies directed towards quantifying and comparing ERs (32, 53, 
98-100, 123, 175, 233, 286, 287, 299, 308-310). The multidimensionality of the ER 
made it difficult to state with any certainty how two or more responses were 
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different. They may have identical wave shapes but different amplitudes; some 
segments may differ while others are identical, or one or more components may 
differ. Multivariate analysis (98-100) circumvented these problems and provided a 
reliable technique for judging the significance of differences. Factor analysis of 
principal components of the ER (175) was found to provide a meaningful, quan- 
titative description when the resulting factor components were rotated following a 
Varimax procedure (175). Promise for this technique was found in results from 
drug studies wherein ERs computed under different doses of the same drug were 
described by similar regression equations, while markedly different equations 
were needed to describe effects for different drugs. 


Neurogenesis, the Underlying Neuroanatomy 
and Neurophysiology of the ER 


A detailed knowledge of the intracranial sources and neural pathways that deter- 
mine the wave form of the ER will provide a powerful technique for both re- 
searchers and diagnosticians. When such knowledge is definitive, the ER may be 
used to identify with reasonable certainty the areas of the brain most affected by 
psychological processes or by disease or injury. We may even know what systems 
or brain areas participate in certain mental processes. There have been a number 
of approaches to this problem. 

One has been the individual] neuron or single cell model wherein an explanation 
of the configuration of the ER was sought in the sum of time-locked neuronal 
activities of many cells. Advocates of this hypothesis, Creutzfeldt & others (80-84, 
133), proposed that averaged single neuron activity was responsible for all the 
different components of the surface ER. From purely visual inspection of the 
intracellular frequency and the wave form of the surface ER there appeared to be ` 
support for this notion since the frequency of intracellular activity and gross 
surface waves were at least superficially similar. Also the frequencies of both 
changed in a similar manner during sleep and wakefulness (81, 83, 133). With 
careful scrutiny and examination of longer sections of the record, the relationships 
between intracellular activity and ER waves were found to be fleeting (124). As 
different laboratories have reported different relationships between unit firing and 
local surface slow wave activity (80, 81, 94, 124, 133), evidence is accumulating that 
there is really no fixed relation between unit firing frequency and local slow wave 
ER activity. The phenomenon may be an ignis fatuus. 

Anatomical pathways have been another approach to tracing the source of 
components of the ER. A long-standing and convenient hypothesis regarding 
` these sources was that the ER could be divided into two main temporal sequences 
known as the primary or early response and postprimary or late response. These 
were believed to represent two anatomically different systems. Identifying features 
of the earlier primary response were that conduction was via specific thalamo- 
cortical pathways to specific primary sensory areas of the cortex (38) and that it 
remained unchanged or in some cases enhanced under anesthesia and during 
sleep. The postprimary response, arriving 50-100 msec after the stimulus, was 
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thought to result from neural volleys traveling through extralemniscal pathways 
and hence involves the reticular formation and unspecific thalamic nuclei in their 
conduction. The postprimary response was characterized as being polysensory in 
nature, widely and bilaterally distributed over the cortex, and markedly sensitive 
to changes in levels of consciousness, as with attention, anesthesia, and sleep. 
Because of these features the early response was regarded by many to be “sensory 
receiving” in function, as contrasted to the presumed function of “information 
processing” assigned to the late components. 

While there remains general agreement about the neural systems and pathways 
that give rise to the early components, the source of the late response has been 
seriously challenged (142, 304, 307, 324). From careful isopotential mapping 
based on the notion that intracranial sources were reflected in scalp recording 
according to the principles of volume conduction, Vaughn (304) reported that all 
components of the human auditory evoked response, including the long latency 
vertex potentials, derived from a dipole source reversing polarity at the level of the 
Sylvian fissure. In a similar manner, the somatosensory evoked response was 
generated solely in the primary somatosensory projection area. This assertion, if 
taken at face value, was in conflict with long established beliefs concerning the 
localization and presumptive origin of the late or long latency components, par- 
ticularly the vertex wave. 

These findings were challenged by Kooi, Tipton & Marshall (183), who re- 
peated isopotential mapping but did not use, as Vaughn did, a cephalic reference. 
. They found no evidence of a polarity reversal. Others (221) have confirmed Kooi’s 
observations. The existence of a deeply located dipole in the primary auditory 
cortex as the source of all components of ER is thus questionable. Others 
attempting to calculate the potential distribution on the scalp produced by hypo- 
thetical sources within the skull and to relate these to the neuroanatomy under- 
lying the evoked response (144, 183, 185) have met with a number of problems. 
Some even express doubt that it is reasonable to regard the brain as an isotropic 
volume conductor (194, 209). 

Other evidence against the “extralemniscal hypothesis” was more convincing. 
The suggestion that the late longer latency components of the ER derive from 
nonspecific impulses transmitted along extralemniscal bilaterally projecting retic- 
ular and thalamic pathways was refuted by a number of experiments (142, 184, 
187, 284, 324). These studies involved patients with unilateral cerebral lesions 
which disrupted primary somatic pathways at or above the thalamic level. The 
“extralemniscal hypothesis” predicts that stimulation of the affected side of the 
body should produce ERs over the unaffected hemisphere. That is, the 
extralemniscal conduction would escape or bypass the point of injury or lesions. A 
startling result, however, was the bilateral loss of all evoked response components, 
including those over the unaffected hemisphere, with stimulation of the affected 
side of the body. 

Despite these findings, data continue to accumulate which indicate that differ- 
ent components and different time segments of the ER change differentially. 
There must be.two or more systems or neuroanatomical pathways that determine 
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the wave form configuration of the evoked response to account for the repeatedly 
observed changes in ER components differentially determined by a number of 
conditions—physiological, psychological, pharmacological, and developmental. 
So, while we must abandon the notion that there is a parallel projecting nonspecific 
system, late components could reflect activity arising from this system but 
activated serially through corticifugal fibers deriving from the primary area or its 
immediate associational areas. Once activated in this manner through primary 
cortex, the nonspecific system could then project actively to the cortex in the 
diffuse manner observed. The evidence does not permit us to distinguish this 
possibility from others involving corticifugal and commissural conduction. 


Sensory Receiving 


ERs faithfully reflect the nature of the evoking stimulus. Not only do different 
sensory modalities evoke different responses, i.e. VERs, AERs, and SERs are 
uniquely different, but qualitative aspects of each stimulus may also be reflected in 
the wave form configuration of the ER. For instance, the VER changes differen- 
tially to patterned light, colored light, etc. ERs also change to “meaningful” 
stimuli. By an arbitrary separation ERs may be classified as those reflecting 
“qualitative” aspects of the stimulus, such as pattern or color, and those reflecting 
psychological state or “meaningfulness.” This section reviews those studies in 
which ERs are thought to reflect characteristics or qualities of the stimulus. 
Information processing will review those studies that deal mainly with psycho- 
logical variables. | 


VISUAL EVOKED RESPONSES Possibly because it was the first to be described (71, 
92), the VER has received more systematic attention than other ERs. Despite the 
variety of techniques, electrode placement, stimulating procedures, etc, there is a 
surprising agreement in the literature as to the wave form configuration of the 
VER. Schematic comparisons of mean VERs obtained from different laboratories 
appear in several reviews (33, 218, 265). Comparisons derive from investigators 
using a flash stimulus of brief duration and diffuse light (10, 63, 71, 110, 138, 182, 
238, 259, 301, 302). In these studies from 6 to 9 peaks or components were 
described. Serious effort has been directed towards reliable identification of the 
peaks of the VER. In one study (153) wave form patterns evoked by 4 intensities of 
‘flash were studied in relation to intersubject variability. An algorithm was devel- 
oped which identified, with 67-100 percent accuracy, instances when a single 
peak event.occurred within the time range of each of 6 peak distributions, indi- 
cating that there are at least 6 reliable components. 

Brightness, frequency and movement all affected the VER in unique ways. La- 
tencies of VER components decreased with increased intensity. This held true for 
components peaking up to 500 or more msec after the stimulus (272, 294, 295, 306, 
321), With increasing intensity the peak to trough amplitude of VER components 
increased but leveled off and even decreased when higher intensities were reached 
(8, 66, 93, 239, 294, 295, 303). A single moving spot of light elicited a VER (11-13) 
as did moving patterned stimuli (195). The amplitude of the VER elicited by 
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movement was found to be a function of the change of the angular velocity of the 
stimulus. Steady-state VERs, visual responses elicited by sinusoidally modulated 
light (67-69, 208, 230, 232, 299, 300), contributed a new and promising dimension 
to ER research. The response to this form of stimulation was roughly sinusoidal, 
following faithfully the frequency of the undulating light within certain limits. - 
Intensity, frequency, spectra, and other variables were investigated. Regan (233) 
provides a scholarly, in-depth review of these- techniques. 

This situation piqued the curiosity of some psychiatrists, notably Callaway (51), 
who found individual differences in the steady-state response. Subjects could be 
divided according to their response to sinusoidal photic stimulation. There were 
those who responded with augmented responses and those whose responses were 
reduced with sinusoidal stimuli. 


Pattern and contour Spehlman (279) noted marked differences between VERs 
evoked by a uniform field and those by a patterned field. A positive wave 
peaking at 180-250 msec increased markedly in amplitude with patterned stimuli. 
It was noted that a critical factor appeared to be the density of contrast borders or 
the check-size of the pattern; the greater the contrast, the larger the response (118, 
155-157, 319, 320). In addition to the enhancement of a late wave, it appeared that 
when VERs to a uniform stimulus field were compared with VERs to a highly 
contrasted patterned field, the polarity of a prominent positive wave at about 100 
msec reversed, becoming negative without any apparent change in latency (155, 
240). This polarity could be reversed merely by changing the retinal location of the 
patterned stimulus or by defocusing the pattern itself (155). Patterns with corners 
evoked larger VERs than striped patterns (194, 240). These studies have provided 
the basis for a possible method of refraction, particularly with children. Amplitude 
and configuration of VERs to patterned light will indicate when the image is in 
best focus. 


Color and spectra Shipley, Jones & Fry (271-273) reported that the wave form 
of the ER to various spectra showed color-specific differences which persisted over 
a wide range of intensities. They not only equated the luminance of the different 
colored stimuli by heterochromatic flicker photometry, but varied intensities over 
a range of 5 log units. Clynes (66-70) extracted different color-specific components 
of the VER and reported that distinguishable ER wave forms were specific to 
different stimulus colors. When red, green, and blue stimuli were abruptly 
switched on, different wave forms were evoked. Subjects with normal color vision 
showed a gradual change in their VER wave form as the wave length was pro- 
gressively changed from 680 nm (deep red) to 380 nm (far violet). Three distinct 
ERs could be recognized by purely visual inspection. Certain components were 
missing or distorted in color-blind subjects, depending on the type of the disorder. 
Other investigators have confirmed and extended these findings (9, 58, 64, 229, 
231, 234, 241). 

VERs to word stimuli have been studied by several investigators who could not 
unequivocally show that VERs to word stimuli could be distinguished from re- 
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sponses to nonword or nonsense syllable stimuli equated for brightness (29, 30, 48, 
49, 174, 188). Although a difference was reported, Shelburne (270) had an alter- 
nate explanation. The difference that did occur was generally an increased posi- 
tivity, maximal at latencies of 450 to 500 msec after the stimulus presentation. As 
the CNV usually terminates with a positive wave at about this time, it seemed 
possible to Shelburne that the CNV was involved in such experiments. 


Species differences The VERs of various species are unique. Rodents (rat, guin- 
ea pig, rabbit) all demonstrate markedly similar VERs, as do various strains of 
cats; VERs of primates, various monkeys and man, are strikingly similar (76-79, 
141, 178, 236). Within a genus there may be notable differences (76, 77, 79) that 
may influence experimental interpretation, e.g. the VERs of albino and hooded 
rats are different. The VERs differ because the visual system of the albino rat is 
defective, as are those of other albino species (77). Differences between VERs in 
strains of rats probably account for the apparent conflict in a number of studies, as 
these animals are so extensively used in perceptual research (77). 


Changes with maturation and aging VERs change markedly during infancy 
(119-122, 125, 129, 167, 243, 244), throughout maturation and development (19, 
22-24, 28, 112, 113, 115, 116), and during aging and senescence (257, 258, 265). As 
the ER may change twofold or more in amplitude at different age levels, the need 
for developmental normative data before clinical application is obvious. Appar- 
ently there are hereditary factors in the VER, as monozygotic twins demonstrate 
almost identical responses while dizygotic twins do not (108, 111, 186, 215, 326). 


AUDITORY EVOKED RESPONSES (AERS) The vicissitudes of the AER have been the 
most numerous of all the responses. Efforts to define the various components (86, 
87, 89, 90, 146, 224, 225, 247) were frustrated because earlier components could be 
recorded from posterior neck muscles and were presumed to be myogenic, the 
result of reflex muscle contraction to sound. As this came to be generally accepted 
(33, 89, 90, 144), investigators directed their attention to later components, the 
most prominent of which were the N1 and P2 deflections with latencies of 95 to 105 
and 170 to 200 msec, respectively (14-17, 86-90, 224, 225). Late components were 
studied in some detail with a view to applying the findings to audiology. It was 
reported that latency of these waves did not vary greatly with a change in intensity 
of the stimulus, nor were the latency and amplitude of these waves affected by 
changes in frequency between 300 and 4800 Hz. It was reported that the interval 
between stimuli must exceed 6 sec for recovery of maximal amplitude. Stimulating 
every second, as is often done, reduced the amplitude of the AER to one-quarter 
maximum (89, 90). 

By reducing the intensity of the auditory stimulus and by averaging thousands 
of single responses, new insights into the components of the AER have emerged 
(165, 172, 221-223). We can now say with some certainty that early components of 
the AER, of which there are 6 in the first 8 msec of the response, probably 
represent activation of cochlear and auditory nuclei in the brain stem. The middle 
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latency components, probably 5 in number, occurring between 8 and 50 msec, 
represent activation of both thalamus and auditory cortex, but can reflect con- 
current scalp muscle reflexes. The long latency component, occurring after 50 msec 
` and commonly known as P1-N1-P2-N2, represents widespread activiation of the 
frontal cortex and derives from a source other than that responsible for the earlier 
components. 


The AER to speech stimuli It has been proposed that there are specialized 
neuromechanisms required for the perception of speech and, as is well known, 
speech is lateralized to the left hemisphere in most people. Since patterned light 
evokes unique VERs, what are the effects of patterned sound, speech, on the AER? 
Three studies emerged recently, reporting that the AER to speech stimuli dem- 
onstrated hemispheric differences that other auditory stimuli did not (197, 210, 
325). Space permits mention of only one strategy (325). Stop consonants, Ba and 
Da, equated for identical stimulus qualities, were used in two different tasks. In 
one instance the subject was to judge the frequency of the two stop consonants, that 
is, Ba or Da were delivered at two different pitches. In the other, linguistic content 
was the critical variable, i.e. the subject was to distinguish whether the stop 
consonant was Ba or Da. The two situations produced different AERs, but only 
linguistic content resulted in AER asymmetries with larger responses in the left 
hemisphere. 


SOMATOSENSORY EVOKED RESPONSES (SERS) The wave components and distribu- 
tion of the SER over the scalp have been carefully studied by Goff, Shagass, and 
others (1, 5, 142-145, 265-268). On the basis of the scalp distribution of the SER 
and other factors, Goff’s group has divided the components into a schema of 5 
epochs. The earliest epoch contained wave components which were very small or 
absent on the side ipsilateral to stimulation. These consisted of component I, a 
positive-negative-positive sequence with peaks at 16, 20 and 25 msec, respectively; 
component 2, a positive peak at about 30 msec; component 3, a positive peak at 
about 50 msec. The later components, 4 and 5, were bilaterally represented with 
component 4 comprising two peaks, a negative at 65 and a positive at 85 msec. 
Finally, component 5 comprised 2 peaks, a negative peak at 135 and a positive 
peak at about 200 msec. 

When SERs of blind and sighted subjects were compared (128), the SERs of the 
blind had markedly shorter latencies. A late negative component arriving at about 
150 msec in the normals’ SERs arrived approximately 20 msec earlier in SERs of 
the blind subjects. Another interesting observation (253) was that a procedure 
related to acupuncture (electrical cutaneous stimulation) markedly attenuated 
SERs for as long as 20 min. 


EVOKED RESPONSES TO OTHER STIMULI Olfactory evoked responses have been 
recorded from the human (3, 129a, 278a) and quite extensively from the cat and 
rabbit (134). The olfactory ER was reexamined (278a) and stimulation of nasal 
trigeminal afferents found to be a principal source of the “olfactory” ER. Electri- 
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cal stimulation of the tongue elicits a sensation of taste and also an ER (135). 
Vestibular ERs were reported (148). 


CONTINGENT NEGATIVE VARIATION (CNV) The slow negative shift of brain elec- 
trical potential that bridges the temporal gap between two stimuli, S} and S,, when 
the first acts as a warning and the other as an imperative stimulus, has been termed 
the Contingent Negative Variation (CNV). Generally an alerting or warning . 
signal precedes the delivery of an imperative signal to which the subject must 

make some kind of response. The interval between S, and S,, usually 1 to 2 sec, is 

adequate for the full development of the slowly rising negative wave. In the 

decade since this phenomenon was first described and elaborated (205, 311-315), 

there has been an avalanche of studies, but still no consensus. has emerged con- 

cerning the psychological concomitants of the CNV. Different viewpoints can be 

summarized in terms of four hypotheses or interpretations: expectancy (74, 312, 
314); motivation (168, 228); conation (189, 190, 192, 198); and attention (161, 201, 

292, 293). | 

A pitfall in analysis of the CNV is involuntary eye movements during the 
S-S, interval (164, 190, 285). Source and topographic distribution of the CNV. 
was originally presumed to be in anterior regions of the frontal cortex, spreading 
posteriorly to precentral motor areas (313, 315). Järvilehto & Fruhstorfer (170), 
however, were able to differentiate three slow potentials (SPs) by manipulating the 
tasks of subjects. Borda (41) recorded from frontal and central cortical regions in 
the rhesus monkey and reported that SPs recorded from each of these areas 
occurred independently. Rebert (226) confirmed this finding and reported that 
frontal SPs were larger and more consistent than those found in the motor areas. 
Others (105, 245) studied the topographical distribution of the CNV in the monkey 
and reported a frontal-precentral or a postcentral slow potential depending upon 
the nature of the task. 

Numerous studies suggest that the CNV is symmetrically distributed over the 
two hemispheres (189, 190, 196, 314). However, there have been claims for slightly 
larger SPs over the hemisphere contralateral to the hand used for a response (216), 
or over the dominant hemisphere in experimental designs involving numerical 
stimuli and arithmetic operations (50). McCallum & Cummings (202) investigated 
the effects of brain lesions on the CNV and reported convincing correspondence 
between asymmetry of the distribution of the CNV and the site of the lesion in 
those cases where the lesions were localized. With more diffuse lesions there was a 
generalized suppression of the CNV. 

Skinner & Lindsley (276, 277) inserted cryogenic probes into midline anterior 
thalamus of experimental animals. During transient cryogenic blockade of this 
area, slow potentials were abolished; VERs and SERs were enhanced. Further 
evidence that the diffuse nonspecific thalamic system may play some role in the 
genesis of the CNV was provided by Hablitz (150), who demonstrated that CNV 
potentials were seen earlier from leads implanted in nonspecific thalamic nuclei 
than from those in cerebral cortex. This relationship, however, was not true in all 
situations. 


ELECTROPHYSIOLOGY AND BEHAVIOR 243 


Information Processing, Psychological Variables 


In this area there were daring excursions not only into the brain but into the mind. 
Cerebral potentials recorded from gold electrodes implanted in the human brain 
were related to “real” and “imaginary” stimuli (317). Extrasensory cerebral po- 
tentials were reported to be passed from one identical twin to the other (107). 
Despite a number of excellent experiments, there was such a welter of techniques 
and subjective description that one had difficulty separating results from inter- 
pretation. Encountered was “information delivery,” “read-out components,” 
“bias level,” “decision making,” “expectancy waves,” and 20 or more anthropo- 
centric descriptions. One would not be greatly surprised to encounter a slow rising 
late wave of “brotherly love.” Despite these difficulties, evidently this is where the 
action is for psychologists. Possibly we are moving, as some have claimed, towards 
a technique that will provide a “window to the mind.” 


ATTENTION, HABITUATION, SELECTIVE ATTENTION A number of investigators 
have reported that attending, learning, and other mental processes may be 
reflected in the size and wave form of the cerebral evoked response. During 
inattention or during habituation the response has been reported to diminish, 
while during attention it has been observed to increase in size. For example, 
attenuation of the ER in the visual system to a repeated presentation of light 
(habituation) has been reported all along the visual pathways from the optic tract 
to the primary visual cortex, implying some inhibitory influence at all levels of that 
system. Similarly, attenuation of the auditory evoked response to a repeated 
click has been reported in the auditory pathways from the dorsal cochlear nucleus 
to the auditory cortex, and somatosensory ERs declined as a function of time in the 
medial lemniscus, the ventral posterolateral nucleus of the thalamus, and the 
somatosensory cortex (1, 39, 40, 62, 137, 160, 193, 206, 219, 296, 316). Likewise, 
with increased attention to an auditory stimulus, such as a conditioned response to 
a click, enhancement of the auditory response has been reported at dorsal cochlear 
nucleus, medial geniculate body, and auditory cortex. Increased attention to visual 
stimuli altered the form and amplitude of VERs recorded along the visual path- 
ways including the lateral geniculate and the striate and parastriate cortices (27, 
151, 158, 160, 169, 173, 220, 246, 249, 251, 260-63). ERs were also reported to be 
enhanced in human subjects when they were attending to a click (85, 149, 165, 166, 
171a, 214a, 221, 222, 280, 323) or to a flash (27, 117, 137, 152, 156a). In the majority 
of these experiments the late components were most affected. 

When two stimuli were presented simultaneously and subjects were asked to 
attend to one and ignore the other, the amplitude of the late components to the 
ignored stimulus was attenuated while the opposite effect was noted with the 
response to the attended stimulus (60, 61, 101, 152, 254, 255, 280). Näätänen (214) 
and Karlin (176) argued a noncognitive explanation for these results. It will be 
recalled that when changes occurred during attention these changes were confined 
mainly to the late components of the ER. It is well known that during periods of 
increased arousal these components change. As heightened states of arousal en- 


244 BECK 


hance components of the ER, Karlin and Näätänen pressed the point of view ofa 
“prior-state hypothesis.” That is, as stimuli in most of the studies were presented in 
a more or less orderly fashion, the subject had time to prepare for the critical 
stimulus, and thus enhancement was due to a level of increased arousal, not to 
selective attention, hence noncognitive. They also argued a “reactive change 
hypothesis,” which meant that after the critical stimulus had occurred and the 
noncritical stimulus was presented the subject was in a relaxed state and hence 
there was attenuation of these components. These arguments were disturbing but 
valid. A flurry of disagreement found its way into the literature (102, 103, 212, 213) 
and was summarized by Regan (233) with a characteristically British comment: 
“This exchange of letters serves to confirm that any ambiguity or imprecision in 
defining the psychological terms used in evoked response research is a most potent 
source of misunderstanding and confusion.” Even prior to the admonitions of 
Näätäten and Karlin, some experiments had been reported that met their objec- 
tions (27, 117, 251). 

Recently the effects of selective attention on both VERs and SERs have been 
investigated with conditions regarding transient changes in arousal level and 
peripheral orienting factors reduced to a minimum, with stimuli presented in a 
random and unpredictable sequence, and time epochs studied which could not be 
related to the effects of CNV. Harter & Salmon (156a) investigated the effects of 
selective attention on VERs. When the stimulus was attended, a reliable en- 
hancement of a surface negative wave peaking at about 220 msec and a surface 
positive wave at about 315 msec occurred. Ford et al (132a) extended this ap- 
proach to include both VERs and AERs. They reported that a negative component 
(arriving at about 200 msec in both systems) reflected selection between sensory 
modalities, while a later positive wave, peaking after 400 msec, indicated more 
complex intramodal discrimination. In what appeared to be a decisive experiment, 
Picton & Hillyard (223) investigated the effects of attention on the AER. With 
attention directed towards auditory stimuli to detect an occasional faint signal, 
there was a significant increase in the amplitude of two late waves: N1 occurring at 
83 msec, and P2 occurring at 161 msec (mean time). A striking effect in these 
experiments was the appearance of a large “P3” component at about 450 msec in 
response to the detected signals. This was topographically distinct from the pre- 
ceding components and could be recorded only in response to a detected stimulus 
or a detected omission of the stimulus in a regular train. From this it appeared that 
two systems are operating in selective attention, one indicated by the increased 
amplitude of the N1-P2 waves, the second reflected in the later more widespread 
P3 complex. The first may involve the selection of a particular input channel to be 
examined, while the second may indicate information processing. It should bé 
noted that these last two studies (132a, 223) are in close agreement. 


INFORMATION DELIVERY—STIMULUS UNCERTAINTY, THE P3 WAVE Sutton, Brar- 
en, Tueting, and Zubin (289,290) were the first to describe the late positive 
component that was uniquely sensitive to what appeared to be the internal state of 
the subject. This was a slow (250-500 msec) positive wave similar to the vertex 
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wave originally described by Davis (88). Sutton described the late positive wave as 
occurring around 300 msec.and hence referred to it as P300, later to be called the 
P3 wave. The P3 wave appeared to be far less closely coupled to the evoking 
stimuli than any other wave. In fact, the absence of a stimulus could evoke the P3 
wave, provided it was attributed to the “resolution of some uncertainty” (290,317). 
An exciting example of this was found in recordings made from intracerebral 
electrodes in five patients under treatment for chronic obsessional and anxiety 
disorders (317). Clear cerebral electrical potentials termed “omitted potentials” 
were observed when stimuli were expected but did not occur. These “omitted 
potentials” resembled those evoked when the real stimuli were presented. 

The P3, occurring during a similar time epoch as the CNV, has been thought by 
some to be a manifestation of CNV activity. However, Donald & Goff (97) have 
demonstrated that these two phenomena, the P3 and the CNV, are dissociated. 

Since the P3 has been found to be exquisitely sensitive to a variety of experi- 
mental maneuvers, it has excited considerable research and a variety of opinions. 
Defined operationally, an enhanced P3 may indicate: an occasional stimulus 
change (179, 222, 242, 278); a signal for a stmple or choice reaction (106, 322); the 
detection of a threshold level auditory signal (166, 217, 281); a signal which 
confirms or refutes a prior guess (289, 290, 298); a signal to shift motor set (97); and 
finally a letter which completes a word (270). A number of formulations of the 
psychological processes that accompany the P3 have been offered. To this reviewer 
the most acceptable is that from the studies of Hillyard & Picton (165, 221, 222). 
The P3 wave is enhanced only when stimulus information is being actively pro- 
cessed and is uniquely associated with the occurrence of a signal and its correct 
detection. 


PSYCHOLOGICAL VARIABLES AND THECNV The magnitude of the CNV was found 
to be dependent upon: qualities of the stimulus such as the intensity of S2, the 
imperative stimulus (166, 191, 228); uncertainty in the delivery of S2 (190, 312); 
distraction (205, 293); information contained in S2 (312, 313); duration of the 
S1-S2 interval as well as reaction time (162, 163, 199, 293); effort expended (41, 
192, 227); response accuracy (162, 191, 198, 200); concentration and anxiety (180, 
191, 198); sex differences (180); and finally psychopathology (42, 202, 204, 205). 
Results from nonhuman primates (41, 105, 150, 207, 226) generally support the 
conclusions drawn from some human studies (57, 170) that (a) the CNV is not a 
single process, and (b) that its nature and cerebral topography are dependent upon 
the state of the organism and the task imposed. 


SLEEP, ANESTHESIA, HYPNOSIS, AND ALTERED CONSCIOUSNESS VERs, AERs, 
and SERs have been investigated during various stages of sleep. They were found 
to change during sleep in man (45, 46, 138, 143, 182, 248) and in animals (2, 4, 6, 7), 
yielding responses that characterized the level of sleep. Late components of ERs 
were generally the waves most affected. Reviews of sleep research and sleep 
disorders by Broughton (45, 46) provide additional detail and pertinent back- 
ground. The evolution of sleep received careful and systematic attention in 
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symposia and papers (2, 6, 7), with reports indicating that “slow wave” and “para- 
doxical” sleep first developed phylogenetically with birds and mammals in paral- 
lel with the evolution of homeothermy, possibly as a mechanism to minimize 
energy expenditure. The effects of anesthesia on the VER were reviewed by 
Brazier (43), and effects of general anesthetics on VERs, AERs, and SERs in man 
and animal by Clark & Rosner (65). 

Hypnosis, initially reported to have marked effects on the amplitude of the ER 
(69, 160), was found with larger groups of subjects to effect no reliable change 
other than increased stability (20, 25, 154). The ER appears to be a more accurate 
measure of cerebral death than the conventional EEG. In both man (297) and 
animals (21, 31) the ER was found to be more durable than the EEG during 
periods of serious cerebral embarassment and more predictive of the time of 
“brain death.” This technique may prove valuable in determining cerebral death 
in the presence of ambiguous cerebral electrical activity, particularly with sus- 
pected death due to barbiturates and other depressants (21, 31, 297). 


INTELLIGENCE AND MENTAL RETARDATION Ertl and co-workers (18, 59a, 126a) 
reported reliable and significant relationships between latency of certain VER 
components and IQ test measures. The peak delays of components of subjects with 
high IQ arrived significantly earlier than did those of subjects with low IQ. The 
differences were large, often more than 100 msec (59). Reported Pearson r corre- 
lations were equally convincing (18), ranging from -.70 and -.80 for measured IQ 
and latency of some waves. While these data suggested that the ER was a prom- 
ising, reliable measure of “biological intelligence,” there has been a general 
‘inability of others: to replicate them (24, 55, 56, 114-116, 235, 259). Although 
Shucard & Horn (274, 275) correlated 16 ability tests, with measurable VER 
components, few significant correlations emerged, none beyond .25. In another 
approach (235) two groups of 20 children matched for age differed in mean group 
IQ by more than 50 points. VERs of the groups differed reliably in amplitude but 
not in latency. Amplitude of certain components and hemispheric asymmetry 
differentiated the groups, brighter children having larger overall responses and 
asymmetry characterized by larger responses in the right hemisphere. As higher 
IQ children were more alert than those of lower intelligence, this difference could 
be due to enhancement of late components because of increased level of attention. 
Engel & Fay (126) investigated the possible relationship of VERs derived in 
neonates with IQ measurement at ages 3 and 4 years. Studying 828 infants, they 
found some indication that the latency measurements of the neonatal VER pre- 
dicted certain motor development at 3 and 4 years, but there was no correlation 
between latency of VER and subsequent intellectual development. Schafer & 
Marcus (256) reported that when the stimuli evoking a response were self- 
administered, both AERs and VERs reduced in amplitude and later components 
arrived earlier. This effect, greater with AERs than VERs, was more marked 
with the brighter subjects. Scientists, technicians, and institutionalized retar- 
dates were compared. 
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Mental retardation has been an area of interest to ER researchers. Barnet and 
co-workers (14-17), noting the differences in normal and developmentally re- 
tarded newborns, reported that click evoked responses were significantly larger in 
infants with Down’s syndrome than in matched normal infants. Others have 
studied VERs of older retarded children (37, 136, 237). Beyond longer latencies 
and missing components in the responses, the most striking difference was that 
reported by Bigum, Dustman & Beck (37). The SERs of mongoloid children not 
only differed from normals but were sufficiently unique to be recognized on 
relatively casual inspection. Somewhat like the disorder itself, characteristics of 
the response were strikingly similar among the Down’s syndrome children. 


EVOKED RESPONSES AND ANIMAL LEARNING; OTHER ELECTRICAL ACTIVITY AND 
BEHAVIOR John (173) demonstrated “generalization” of ERs. Cats trained to one 
task utilizing a stimulus S,, when learning another similar task to stimulus S- 
yielded ERs with periodicity and waves common to both S, and S,. 

While hippocampal theta activity has been previously associated with arousal, 
attention, and orienting behavior, lately remarkable correlations have been shown 
with the rate of theta rhythm and bursts of sniffing and associated vibrissal 
movement in the rat (147, 181). This implies that theta rhythm is associated with 
movement related to orienting, not orienting itself. Eye movement (250, 252), and 
other activity such as exploratory behavior, drinking, grooming, crouching, etc, 
also showed correlations with hippocampal frequencies (147, 181, 250, 252, 
260-264). Schwartzbaum & co-workers (260-263) have related VER amplitude, 
behavioral reactivity, and hippocampal theta in the rat. The animals were con- 
fronted with two situations: in one the series of flashes signaled no contingency; 
and in the other the flashes indicated the delivery of food. In the first, wherein the 
animals became habituated, amplitude of VERs increased and hippocampal 
rhythms shifted to lower frequencies. In the food delivery situation VERs were 
reduced in amplitude and hippocampal activity increased in frequency. Other 
excellent animal studies utilizing the ER as an electrophysiological correlate to 
various behaviors were reported (151, 169, 178, 220, 242, 249). Among the studies 
cited (151, 178, 220) an interesting phenomenon was repeatedly observed. A large 
enduring rhythmical after-discharge of several waves was produced by iterative 
flashes, mainly when. the animal was “relaxed.” Fleming & co-workers (130-132) 
applied this reliably occurring phenomenon to the evaluation of drug effects on 
CNS activity, reporting that those drugs associated with physiological arousal, 
such as amphetamine, pilocarpine, and physostigmine, suppressed the after- 
discharge. When anticonvulsant drugs were evaluated, it was found that di- 
phenylhydantoin (Dilantin) had little effect, while those such as trimethadione, 
generally used in the control of petit mal, greatly attenuated the after-discharge. 

Brief mention should be made of two other electrical patterns, the 40 Hz and the 
12-14 Hz thythms, that lately have been related to behavior. Recording from cats 
implanted with cortical and subcortical electrodes, and analyzing the cerebral 
electrical activity by frequency analyses, Sheer (269) reported that after a criterion 
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level of performance was achieved, there was a repeatedly marked increase in 40 
Hz activity at the time the animals made a “relevant” response. This he interpreted 
to be an electrophysiological correlate of memory consolidation. While Mulhol- 
land (211) and others have demonstrated the ability of human subjects to modify 
the amount of alpha activity in their EEGs, Sterman & Friar (282) demonstrated a 
unique application of this technique, claiming a reduction in the incidence of 
seizures in an epileptic when the patient was trained to produce rhythmic activity 
over the sensory motor area. This application evolved from studies of a rhythm, 
12-14 Hz, occurring over the sensory motor cortex of the cat, seen during volun- 
tary suppression of movement (283). The cat could be “shaped” to produce such a 
rhythm. Furthermore, Sterman & co-workers have been fairly successful in tracing 
out the source of this rhythm which they believe to be generated by neurons within 
the ventral basal thalamus. 

At the time this review was submitted, an International Symposium on Cerebral 
Evoked Potentials was in progress in Brussels. Volumes will be published on new 
findings of the more than 100 contributors. These will be edited by Professor John 
E. Desmedt and will cover new aspects of both ER and CNV research. 

As may be seen, there has been an avalanche of research since Dawson (91) 
opened the door for such techniques. Clinicial and other applications are devel- 
oping rapidly (26, 33, 52, 54, 202, 233, 265). A future reviewer of this area will no 
doubt be as impressed as this reviewer with the diversity and extent of the 
technique. 
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The field of perception reaches to the corners of the psychological landscape. 
Therefore, this review, which primarily covers the literature published during 
calendar years 1972 and 1973, is unapologetically idiosyncratic. The topic is visual 
perception. Color vision and perceptual development receive frequent treatment 
in Annual Review chapters, so these topics will be slighted. Sekuler’s (172) 
admirable chapter in last year’s Annual Review of Psychology covering “spatial 
vision” permits me to tread lightly across that ground. 

I have surveyed the remaining terrain like a helicopter pilot. The simile is apt 
because helicoptering allows one to pull back in order to appreciate the panorama, 
hover low for closer observation, alight to reconnoiter, but it does not permit a 
microscopic examination of every square centimeter of soil. Any pilot is well 
acquainted with the territory near home base; he is more likely to know only 
landmarks farther out. The territory was found to be wide, varied, and interesting. 
However, I must report that there’s a jungle down there, trails that lead nowhere 
overgrown with tangled roots, vegetation that is perpetually green but never 
flowers, and unfriendly tribes separated by nearly impenetrable forest. The tribes 
skirmish only occasionally because they seldom venture from their own clearings, 
which they stand ready to defend at a moment’s notice. 

Two large tribes will be described before the analogy is abandoned. There are 
neurophysiological theorists and psychological-process theorists (functionalists). 
The former deal in flesh and blood mechanisms (e.g. neural lateral interactions), 
the latter in descriptive labels for psychological functions (e.g. size comparator), 
the labels sometimes enclosed in rectangles and connected by lines to form a flow 
diagram, irreverently termed the “Boxology” by a charming Ms. who is a fellow 
tribesperson. 

The tribes can be distinguished by language, habits, and lifestyle. A truly 
remarkable distinguishing feature discovered from this survey is the number and 
character of human subjects deemed appropriate for a psychophysical experi- 
ment. One tribe characteristically tests two or three subjects, and often only one. 
These few subjects are considered as representative, and it is common for multiple 
authors of a research report to have tested each other. The other tribe strives for 
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large samples, tries to estimate population variability, and employs experimenters 
as subjects with great caution fearing experimenter and subject bias. Any investi- 
gator can be categorized as to tribe on the basis of a simple test. Compute the 
median number of human subjects used per experimental condition as deter- 
mined from the investigator’s published research. The criterion cut for deciding 
the theoretical orientation of any investigator lies near 5; functionalists lie above 
the cut. 

Each group finds the language of the other not to its liking. A potentially more 
serious failing is that each tends to ignore the evidence developed by the other. 
Fortunately, both kinds of theorists and theories are thriving. Unfortunately, 
apartheid stands in the way of advancing the science of perception. 


DRAWING INFERENCES FROM DATA 


Brindley (25) proposed an hypothesis linking experience to its neural substrate 
“...namely that physically indistinguishable signals sent from the sense organs to 
the brain cause indistinguishable sensations” (p. 134). Brindley proposed that 
psychophysical procedures requiring judgments of indistinguishability are highly 
to be preferred. Data from matching and threshold determinations were labeled as 
Class A observations. Null-match methods do have high face validity for investi- 
gating neurophysiological hypotheses. Stimuli that are indistinguishable might be 
presumed to have identical neural concomitants [which is the converse of Brind- 
ley’s statement, as Boynton & Onley (24) pointed out]. Indistinguishability must 
mean, however, indistinguishability in all respects. In colorimetry, for example, an 
observer's task is to produce a metameric (look alike) match. Indistinguishability 
demands further verification. The observer must be unable to detect any differ- 
ence between the metamers when they are then presented to him in a two-alter- 
native forced-choice experiment requiring a temporal or spatial discrimination. 
[Compare the above treatment of Class A observations with that of Cornsweet & 
Pinsker (33) and of Sekuler (172).] 

Unfortunately, complete perceptual identity is impossible in the most inter- 
esting experiments because the stimuli are indelibly tagged by their contexts. 
In an experiment on simultaneous hue contrast, a stimulus in a chromatic 
(colored) surround might be matched by a stimulus in an achromatic (neutral) 
surround. After the match has been achieved the stimuli are nevertheless distin- 
guishable by their respective surrounds. The line segments to be matched for 
length in the Müller-Lyer display are tagged by distinguishable arrowheads. Since 
stimuli are seldom context free, null matches, of necessity, are null with respect to 
selected attributes only. Observers are asked to disregard all other differences. 
These less than ideal matches are the best that can be done when it is context that 
makes objectively equal stimuli appear unequal. [Compare with the Class IV 
observations of Boynton & Onley (24).] 

I propose that all null-match techniques be promoted to Class A status, reserv- _ 
ing for Brindley’s indistinguishability-in-all-respects criterion the title Class A’ to 
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designate their prime (i.e. high grade) quality. It requires no great leap to assume, 
when two stimuli look alike under conditions where their contexts differ, that at 
some level the neural activities corresponding to the stimuli are identical. The 
important point is that no matter what the conditions, conferring neural 
indistinguishability by virtue of perceptual indistinguishability is an inference. In 
principle, only Class A’ null matches can be confirmed through forced-choice 
techniques. Even when there is no objective criterion of correct and incorrect 
choice, perceptual indistinguishability can be confirmed via forced-choice tech- 
niques. 

Regardless of class, the convergence of various psychophysical techniques is 
important. If null-match data are comparable to those obtained by ratings or by 
magnitude estimates, then confidence increases concerning what is being mea- 
sured. Suppose operations presumably assessing the same attribute disagree? 
Wagner & Boynton (207) compared several methods of heterochromatic 
photometry. They recommended that null matches for brightness (disregarding 
other color attributes, i.e. hue and saturation) be abandoned in favor of producing 
a minimally distinct border between contiguous color patches for a substantial 
number of very practical reasons. Nevertheless, such a recommendation clearly 
violates functionalist (and phenomenological) tenets. Brightness matching ad- 
dresses the very fundamental meaning of the term brightness. If two stimuli 
meeting the criterion of a minimally distinct border do not appear equally bright, 
then the tractability or reliability of the measures would seem irrelevant. 

Because they are so basic and simple, null-match techniques may lull the 
experimenter into believing that he has found the ultimate in psychophysical 
security. Not so. Always implied is the proviso that the effect has been measured 
relative to the comparison stimulus. On what basis can it be inferred that the 
comparison stimulus is perceived veridically? For example, do parallel lines at- 
tract? Quina & Pollack (163) required subjects to judge the distance between 
parallel lines by determining an equal distance between two dots. Relative to the 
dots, parallels are perceived as closer together. However, dots might repel one 
another, and parallels might also repel, but less strongly, as Weintraub & Tong 
(216) concluded on the basis of other converging evidence. 

Another serious problem is that the comparison stimulus used to “measure” an 
experimental outcome is itself being influenced by the independent variable. 
Howard, Evans & McDonald (93) showed that in a figural-aftereffects paradigm 
the inducing figure affected the location of the comparison figure as well as the test 
figure. An annular surround with only slightly greater luminance than the center 
test disk causes a drastic reduction in the perceived brightness of the disk. Yet 
Guth (76) found little or no change in Als superimposed on the disk when the 
surround luminance was varied. But given a base intensity (I), the just-notice- 
able-increase in intensity (AI) used as an index may covary with the independent 
variable. To quote Guth, 

These facts may be explained by the possibility that the inhibition that causes a 

reduction in the neural activity associated with the disk or image also causes a similar 


266 WEINTRAUB 


reduction in the neural activity associated with the AI flash. If the inhibitory reductions 
occur over the range where AI/I is constant, and if they reduce neural activity from I 
and from AI by the same factor, then absolutely no change is to be expected in the AI 
required for threshold, even though the apparent brightness of the test field decreases 
markedly. . . . To the extent that our measuring instrument (the Al or threshold light) is 
affected by the same inhibition that we are trying to evaluate, our efforts may prove 
useless (76, p. 952). 


ASPECTS OF THE WORLD AS PERCEIVED 


Evidence is accumulating that psychological plane geometry corresponding to 
operations accomplished in the real world with compass and straightedge is 
complex indeed. Perceived rectilinearity among line and/or dot elements depends 
critically upon their orientation and also eye fixation relative to the elements being 
judged (39, 153, 162). Psychological parallelism is a function of the orientation of ` 
the pair of lines (27). Errors in the perceived size of angles depend upon the size of 
the angle, the orientation of the angle, and the technique used in assessing the 
errors (27, 91, 218). There is considerable agreement that obtuse angles are 
` underestimated, but it is not so certain that acute angles are overestimated. Perkins 
(154) presented line drawings of rectangular solids and asked subjects to judge 
whether the drawings represented three-dimensional rectangular boxes. On the 
basis of projective geometry, half the set of drawings could have been projections 
of a rectangular solid, the rest could not. Subjects tended strongly to judge as 
rectangular boxes only the drawings that could have been rectangular boxes. Thus 
perceivers seemed to impose perpendicularity on their angles. The Perkins 
experiment would seem to be a very fruitful one to try cross-culturally in envir- 
onments that are not so carpentered. ) 


PATTERN An excellent tempered review of infant form perception has been 
provided by Bond (20). The notion of figural goodness has received considerable 
experimental and theoretical support in recent years from Garner, whose new 
book (60) summarizes a large body of evidence concerning stimulus structure 
existing in various types of patterns, especially in dots selected from a matrix. [A 
paper by Bear (12) has a similar orientation.] 

Rock (169), in his new book, argues that patterns encompass the notion of 
uprightness with respect to gravity. Familiar patterns possess assigned direction, 
e.g. the “top” of a form. Since gravity defines the orientation of a form for the 
perceiver, tilting the head has little effect on shape constancy, whereas tilting a 
relatively unfamiliar form will affect its phenomenal shape, even though the head 
is tilted to preserve retinal orientation. Wenderoth (14, 221) demonstrated the 
converse, that the axes of figures or intersecting lines used as contextual frames of 
reference influence judgments of apparent verticality (as in the rod-and-frame 
test). 

Schuck (171), by means of a clever series of controls, studied the fragmenting 
of visual images as subjects fixate an EB stimulus. He found that continuous verbal 
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reporting during a trial produced a significant bias to report meaningful fragments 
of the fixated stimulus when compared with other response methods. He was not 
able to support Hebb’s hypothesis concerning the proponderance of meaningful 
fragments among those reappearing. 


PATTERN ANOMALIES (ILLUSIONS) The word illusion, conveying as it does the 
unfortunate connotations of magic and deception, should probably by exorcised 
from the vocabulary of perception. With the exception of the literature on pattern, 
very few perceptual anomalies are called illusions. Judged by the quantity of 
published material, pattern anomalies is one of the most popular topics in per- 
ceptual research and theory. An entire issue of the new journal Perception was 
devoted to anomalies, primarily pattern. The handbook of pattern anomalies has 
been written. It is well organized, comprehensive, up-to-date, and suprisingly 
undogmatic. To find out what theoretical viewpoints prevail and what data exist, 
go directly to Robinson’s The Psychology of Visual Illusion (168)—before starting 
to experiment. 

Cowan’s (34) research on the twisted-cord effect, essentially a concentric set of 
lines that appear to be spiraling inward, represents a systematic attack on a 
relatively untouched problem. 

The literature reveals that the field has passed beyond merely demonstrating 
that an effect exists. For the most part directionality (positive versus negative 
errors, assimilation versus contrast) is also settled; magnitudes have become im- 
portant. 


Poggendorff effect A current research question that hinges upon the direction- 
ality and magnitude of errors is whether the Poggendorff display (parallel lines 
interrupting a transversal) produces a Poggendorff or anti-Poggendorff effect 
when only the portions of parallels forming an acute angle with the transversal are 
represented. A tally just for the period under review shows two papers claiming a 
small Poggendorff effect (Houck & Mefferd 94, Imai 95; both papers presenting a 
gold mine of additional parametric data as well), two claiming a small anti-Pog- 
gendorff effect (Krantz & Weintraub 105, Pressey & Sweeney 160), and one 
presenting mixed results depending upon psychophysical method among other 
things (Day 40). Granting other complicating variables like the particular acute 
angle employed, one wonders whether to disregard one or another result, or 
average the results and conclude that acute angles alone have little or no effect 
upon judgmental errors in the Poggendorff display. Fortunately, there is unani- 
mous agreement that obtuse-angle-only displays yield a strong Poggendorff effect. 
But overestimation of acute angle bears on the admissibility of certain theories like 
the regression-to-right-angle tendency (Hotopf & Ollerearnshaw 91, 92). Regres- 
sion to right angles could be given a functionalist or a neurophysiological inter- 
pretation. Perceivers tend, because of a carpentered environment, to interpret any 
angle as closer to a right angle than it actually is; or contour interactions cause the 
sides of acute angles to be represented neurally as diverging more than they ought. 
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Müller-Lyer effect The Müller-Lyer arrowhead display remains popular as a 
testing ground for theories, which can be loosely categorized as either context or 
contour theories. Gregory (72-74) continues to defend his misapplied-constancy 
scaling theory. Context, in the form of perspective features in the display, auto- 
matically triggers a size-scaling response below the level of awareness. Georgeson 
& Blakemore (61) presented the display stereoscopically so that the arrowhead 
figure was perceived as flat or with fins tilted in depth either toward or away from 
the subject. Thus the implied depth could be reinforced or counteracted. The data 
did not support the theory; tilting the fins either forward or back reduced errors 
somewhat. Pressey’s assimilation theory (159) postulates that the stimulus context 
determines the observer’s attentive field. A more precise specification of the 
relationship between stimulus parameters and the size and location of the hy- 
pothetical attentive field would appear to be the direction in which theory refine- 
ment should proceed. Other interesting papers bearing on versions of context 
theory are Chiang (28), Coren & Girgus (32), Day (38), Fellows & Thorn (50), and 
Massaro (125). Smets & Haesen (182) suggest a clever demonstration and argue 
from it that the Müller-Lyer effect must be centrally caused. Place an identical 
vertical line in each field of a stereoscope so that they will fuse stereoscopically. 
Add inward-pointing arrowheads to the ends of one line and outward-pointing 
arrowheads to the ends of the other line. Fuse the two vertical lines stereoscopi- 
cally. Ifthe anomaly occurs peripherally, then the vertical lines will be perceived 
as unequal in length and the arrowheads will not join to form Xs. In fact, they do 
join to form Xs. 

Pollack has proposed a theory based upon brightness contrast that explains the 
decline in the Müller-Lyer effect with age as a decline in sensitivity to contours. 
Pollack & Silvar (158) reported that subjects with more dense pigmentation of the 
fundus oculi of the eye produced smaller errors. Bayer & Pressey (11) could 
not replicate that finding in a within-race design. Ebert & Pollack (48, 49), in more 
recent work, have not demonstrated a consistent relationship between ocular 
pigmentation and error magnitude when variables such as stimulus duration are 
altered. The converse of Pollack’s hypothesis can be argued on logical grounds, 
namely, that ocular pigmentation reduces stray light thereby sharpening retinal 
contrast and increasing judgmental errors. Weintraub, Tong & Smith (217) 
emphasized the role of differences in contour contrast. Increasing the contrast of 
arrowheads, while decreasing the contrast of the shaft, enhanced errors. 


Ponzo effect Misapplied-constancy scaling applies directly to a display that can 
be likened to viewing railroad tracks with segments of cross ties at different 
distances. Fisher (54) interpreted his results as supporting a contour-proximity 
theory. Fineman & Carlson (53) embedded the test lines in a texture gradient of 
dots à la Gibson; the effect did not differ from what was obtained with a texture- 
less background—evidence against misapplied-constancy scaling. Leibowitz & 
Pick (109) compared Ugandan villagers with college students from America, 
Guam, and Uganda. The villagers showed no Ponzo effect under any circum- 
stances, while the others produced increasing errors as depth cues were introduced 
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into the stimulus display. A contour-interaction theory cannot explain why the 
Ponzo display was totally ineffective among villagers. 


Theory Girgus & Coren (64) have been exploiting short-term error decrement as 
a theoretical tool. Through practice or mere inspection over a period of minutes 
many pattern anomalies show a decrease in error magnitude. These patterns, they 
assert, must have a judgmental (cognitive) component. Stadler (186) effectively 
criticizes Ganz’s equating of figural aftereffects and simultaneous effects. Walker 
(208) has laid it on the line with an explicit contour-interaction model based upon 
spurious excitation in the vicinity of intersecting contours at the retinal level. The 
first return volley is on its way (MacLeod, Virsu & Carpenter 121). This theory was 
designed for intersecting-line displays like the Pogendorff. Assimilation theory 
(Pressey & Sweeney 161) and cognitive mistracking (Weintraub & Tong 216) also 
claim to explain the Poggendorff display. 

‘What sorts of theory are tenable? My biases are that most pattern anomalies are 
multiply caused, and that all anomalies do not share the same set of factors. Can 
any theories be excluded? Judging by the amount of negative evidence, misapplied- 
constancy scaling should be playing only a minor role in most anomalies, Other 
types of context effects, perhaps acquired as the child interacts with his environ- 
ment, may be involved, but I predict that for the pattern anomalies considered 
above (but not for all pattern anomalies), the sine qua non is some form of contour 
interaction, one probably very little affected by learning. The task is to determine 
the nature and locus (central? peripheral?) of the contour effect or effects. Per- 
haps one or more of the above theoretical formulations is close to the truth. 


COLOR Blending phenomenal description and experimental observation is Beck’s 
book on surface color (13). As an example of the flavor of the book, the following 
terms describe subjectively different aspects of the luminousness of light: bright- 
ness, lightness, perceived intensity of illumination, apparent luminance. Plastic 
soupspoons over the eyes provide cheap, convenient, and effective Ganzfeld 
conditions. Crovitz (35) studied binocular color rivalry by painting one soupspoon 
red and one green. Observers tended to see a field divided down the middle into 
two colors with the nasal part of each field being suppressed. Loomis (118) 
eliminated the photopigment-bleaching hypothesis for the Bidwell subjective- 
color phenomenon. A red interval sandwiched between a black and a white interval 
presented in a repeating sequence appears as a flickering blue-green hue. Loomis 
compared the afterimage of the Bidwell color with an adjacent afterimage formed 
by a reordering of the same three light intervals, which appeared red. He showed 
, that the two afterimages were not identical (but instead were complementary to 
their respective perceived hues) even though the retinal bleaches were identical. 
Kitterle (103) found that simultaneous lightness contrast was a function of the 
orientation of the test bar, and argued that the contrast effect could not, therefore, 
take place at the retinal level. 

Drawing on a huge resource of published data, Bornstein (21) has proposed a 
provocative physiologically based theory to explain cultural differences in color 
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naming. The hypothesis states that, as a result of diet and/or environmental 
adaptation to the ultraviolet components of sunlight, vision can become short- 
wavelength insensitive. Thus blues and greens, colors in which short wavelengths 
predominate, would tend to be confused and, therefore, be lumped into the same 
verbal color category. 


SIZE AND DISTANCE An agreed-upon terminology is a must in order to avoid 
confusion. Four levels of discourse will be illustrated with size terms: the stimulus 

‚as it exists in the physical world, that is, the distal stimulus (e.g. physical size); 
unobservable variables relating to the world of experience ( perceived size); re- 
sponse variables ( judged size); and the proximal stimulus (retinal size), the physical 
representation of a particular cue at the retinal level. (In my opinion, retinal size, 
since it is not represented directly in experience, has no necessarily preferred status 
in any function relating perceived size to perceived distance.) 

There are various means of judging size. A null-match experiment might in- 
volve an adjustable comparison stimulus at a fixed physical distance that is to be 
matched with other stimuli differing in physical size and physical distance, Ex- 
treme caution is necessary in inferring perceived size from various measures of 
judged size. Under what circumstances might a theorist decide that the setting of 
the comparison stimulus measures perceived size? Perhaps when the comparison 
stimulus is reasonably close to the observer, and all distance cues are available, 
then the argument could be made that the comparison stimulus, since it is per- 
ceived veridically in some sense, measures the perceived sizes of other stimuli. 

Although there is good reason to believe that retinal size is not directly regis- 
tered in experience, its physical equivalent, visual angle, would seem to have a 
perceptual equivalent. Under reduced cue conditions the perceptual equivalents of 
physical size, physical distance, and physical visual angle were shown by Foley (55) 
to characterize a perceptual space that could not be Euclidean. A stimulus ob- 
served in the periphery along a horizontal meridian must be moved closer to 
appear equal to the same stimulus judged foveally (Newsome 149). The outcome 
might be interpreted in terms of a change in perceived visual angle rather than the 
experimenter’s interpretation of change in perceived size. Moon illusioners take 
note: If the distortion, which is large, also takes place along the vertical retinal 
meridian, then objects higher in the visual field will be perceived as smaller. 
[Incidentally, vestibular stimulation induced by acceleration in a parallel swing 
had no effect on a luminous-disk equivalent of the moon illusion, according to Van 
Eyl (202)]. 

Observer cues, convergence, accommodation, binocular disparity, and volun- 
tary eye movements were investigated with respect to size and distance (56, 110, 
151). In (110) accommodation was determined in unintrusive fashion by asking 
the subject to detect the motion of speckle in his visual field produced by a laser- 
scintillation technique. When an observer wears glasses that alter accommodation 
and convergence, Wallach and co-workers (212, 213) have discovered alterations 
in judged size, stereoscopic depth, and pointing with a previously unviewed arm, 
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implying that changes in perceived distance were involved, It is easy to train 
observers to rotate their heads while viewing a stimulus monocularly in order to 
create motion parallax for themselves. Observers make very accurate judgments of 
depth after ten practice trials (Ferris 51). Johansson (96) also demonstrated the 
efficacy of voluntary head movements in judging depth. 

Gogel (66, 69) has continued his systematic experimental and theoretical attack 
on size-distance problems; back-to-back papers (67, 68) summarize his position. 
Two tendencies are postulated. The specific-distance tendency says that an object 
will tend toward being perceived at a near distance (about 2 meters) from the 
observer as cues to physical distance degenerate. The equidistance tendency says 
that objects will tend toward being perceived at the same distance from the 
observer as cues to physical distance degenerate. Thus a perceived distance is 
associated with every object even under the most severe reduction conditions. 
Gogel proposes an hypothesis about perceptual interactions. The proximal cues 
for physical distance do not alone specify perceived distance (see the two ten- 
dencies described above). One can determine what Gogel calls a perceptual 
equation describing the interaction between perceived size and perceived dis- 
tance. The equation is not truly a perceptual equation because nonperceptual 
proximal stimuli are also represented. A completely perceptual equation is a 
mathematical representation of perceptual space. 

Over the years a comprehensive set of proximal cues has been assembled and a 
beginning made of specifying interactions among subsets. Needed badly are 
overlying principles that encompass many specific cues-to-percept relationships. 
Gogel’s approach is to be commended. (I feel that his theoretical principles are 
aimed in the proper direction.) 

Two interesting papers considered shape constancy for slanted plane stimuli 
Perkins (155) presented outline drawings of boxes, some of which could not be 
projections of rectangular boxes and asked subjects to state whether each box was 
rectangular. Even at a severe slant, subjects tended to respond to the outline, not in 
terms of its retinal projection but in terms of the outline on the card. Massaro (127) 
measured reaction times and errors in classifying a stimulus as circular or ellip- 
tical. Reaction times for correct judgments and percent errors increased with in- 
creasing slant. 

Gibson (62) has re-elaborated his theoretical position; Gyr (78) marshalled 
cogent criticisms; Gibson responded (63). Parenthetically, texture may be over- 
rated as a cue to depth (Newman 148). 

The Pulfrich stereophenomenon (viewing pendular oscillation binocularly with 
a neutral-density filter over one eye leads to perceived orbital motion) continues to 
attract attention. Harker (80) has incorporated it into a testing procedure as one of 
the indicators of binocular vision with clinical use in mind. Levick, Cleland & 
Coombs (113) lay out the calculations to show that the expected orbit of pendular 
motion is a conic section, assuming the simple explanation of the phenomenon to 
be time lag between eyes for signals reaching the brain. New data are provided 
by Rogers & Anstis (170). 
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And finally, a review by Miller (140) of cross-cultural research on the perception of 
pictorial materials, its perils and pitfalls, is highly recommended for those inter- 
ested in supporting theories of depth perception with cross-cultural data. 


CONTEXT 


Regardless of whether one believes with Gibson that perceptual experience leads 
the perceiver toward veridicality, there is no question that a variety of stimulus 
factors interfere with the achievement of veridicality. When stimuli from a set are 
presented one at a time to be identified by observers—the method of single stimuli 
or absolute judgments—the data bear strong resemblances to the outcomes of 
paired-associate learning experiments. Much as in the learning experiment, sub- 
jects are first shown how to assign a number to each stimulus; the stimuli are then 
presented in a random order and the correct number is called for. The outcomes 
are influenced by many of the variables known to influence learning (see Siegel & 
Siegel 177). In a pitch-discrimination task clear sequential dependencies are re- 
vealed for trials on which the just-previous stimulus is repeated (Siegel 179, 
confirmed by Aiken, Abrams & Shennum 2). Lockhead (116), in his latest sum- 
mary article, also documents memory effects as a function of sequential depen- 
dencies. The conclusion is that a large portion of the limitation on performance 
may be a memory limitation on remembering the assignment of responses to 
stimuli. The subject will tend to repeat his previous response on repetition trials 
with no feedback independently of whether he was previously correct or incorrect 
(179, but not confirmed by 2). 

Another documented tendency is that an observer reveals a propensity to 
employ all the available response categories equally often. Steinberg & Sekuler 
(190) obtained confirming evidence using category judgments. 

The range of stimuli being judged also strongly influences judgments. Parducci 
(152) developed a range-frequency model that is the prototype for the more recent 
formulations. Gravetter & Lockhead (70) presented a similar model, which, like 
Parducci’s, is in the tradition of Thurstonian category scaling. 

In contrast to the above approach to context, Helson’s adaptation-level theory 
has tended to accept judgments as valid indicators of the perceived stimulus. 
Anchor stimuli do shift judgments [Ellis, with comparative judgments to avoid 
response bias (47); Steger, Wilkinson & Carter (189)]. The shift can be moderated 
by telling subjects after practice that they are excellent judgers and to keep up the 
good work (Levine 114). In principle, forced-choice techniques and Thurstonian 
scaling (and, therefore, signal-detection analysis) can eliminate or separate re- 
sponse bias (response-criterion effects) from perceiving (discriminability) (see 
McNicol & Pennington 134). I predict that the pioneering adaptation-level for- 
mulation will be absorbed by the approaches capable of treating the effects of 
context in more sophisticated ways. Incidentally, I am convinced that no possible 
appeal to response bias can explain away the differences in hue of Helson’s 
original work on adaptation level in which nonselective stimuli (grays under nor- 
mal illumination) can appear to have any of a wide range of hues—red through 
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achromatic through green—as a function of reflectance relative to the adapting 
reflectance of long wavelength (red) light (Helson 86). What I am suggesting is that 
theories attempting to explain adaptation-level effects in terms of response bias 
must be specific about the kinds of stimuli to which the explanation applies. I 
believe that adaptation-level theory itself went awry when it became a general 
theory of context effects. 

Context can be provided by knowledge of the characteristics of an object. 
Borresen (22) found that stimuli with characteristics that implied motion in a 
given direction (e.g. an automobile) led to a significant number of autokinetic 
movements in the proper direction. Biederman and co-workers (15) scrambled the 
context by sectioning a picture into blocks and rearranging them. Search time for 
an object was the dependent variable. Jonides & Gleitman (99) obtained reac- 
tion-time differences depending upon whether the subject was searching for an 
item in the same category or one in a different category from the rest of the items in 
the array. The symbol 0 is ambiguous until conceptually categorized by the subject 
as either a letter or digit. The symbol was processed during search as though it were 
a member of the set to which it was assigned conceptually. _ 

The Stroop Color Word Test contains interference trials on which subjects are 
asked to name the hue of the ink in which each word is printed while ignoring the 
word, which spells an incongruent color name. Dyer (44) summarized and inte- 
grated the recent experimental literature. There is strong consensus that response 
competition is involved. If so, then the next step is to explain response competi- 
tion. To response competition Dyer adds a second factor, a breakdown in selective 
attention, to be specific, an inability to focus on a color and at the same time gate 
the word itself. Dyer & Severance (45) have tested a technique useful in Stroop- 
like situations involving pairs of dimensions that cannot be combined spatially, or 
useful in studying the time course of the interference. The two dimensions can be 
presented sequentially. When a color patch is preceded by the flashing of a color 
name in black, a reliable Stroop effect is obtained, but it is smaller by about half. 
The following references were published during the time period covered by this 
review but not listed in Dyer’s review (43, 58, 90, 115, 157, 173). 

Not all word contexts are detrimental as in the Stroop test. A word-superiority 
effect was obtained by Reicher (165). A letter can be identified more easily when it 
is embedded in a word than when it is presented alone or in a sequence of unrelated 
letters. The finding is counterintuitive. How can an observer gain more informa- 
tion about each letter in a four-letter word than about a single letter in a one-letter 
array? The phenomenon has, therefore, attracted considerable attention. Bouma 
(23), in an investigation directed at the process of reading, found that recognition 
declines with distance from the fovea (less so toward the right). For a letter of a 
given eccentricity, i.e. distance from the fovea, recognition is better for that letter if 
the rest of the word lies closer to the fovea. Thus, for a letter to the right of fixation, 
if that letter begins a word, then it is recognized less well than if it remains in the 
same location and ends a word. The opposite is true to the left of fixation. A 
reasonable conclusion is that some type of lateral masking among adjacent stimuli 
acts more strongly in the direction of the fovea than against it. (Both words and 
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nonwords were used as stimuli. Stimuli were presented for 100 milliseconds, and 
no mask was used.) The Taylor & Brown (194) U-shaped recognition function, 
found to the right of fixation with unlimited viewing, seems to be accounted for by 
Bouma’s two factors. Compare the data of Monti (144) also. When Shaw & Weigel 
(175) replaced letters in a string with bars or blanks, recognition was facilitated. If 
a string of letters is fixated at its center, then the lateral masking of adjacent 
contours diminishes as one proceeds toward the initial and final letters of the 
string. However, this is more than counteracted by the general decline in recog- 
` nition toward the periphery. 

What is the effect of adding a mask? The ends ofa string seem less affected by the 
mask. Is this because the ends are processed first and masking has its greatest 
impact on the remainder of the string? Henderson & Park (87), through the use of 
monetary payoffs favoring the middle, still found that a pattern mask most inter- 
feres with the middle of a string. Johnston & McClelland (98) reported that the 
word-superiority effect depended critically on the type of postexposure field. A 
pattern mask produces the effect; a plain white field as mask abolishes it. 

What cognitive factors are involved? Mezrich (138) hypothesized that a single 
letter was processed as a spatial pattern rather than a verbal code presumed for 
words. Mezrich required subjects to vocalize each stimulus, letter or word, to make 
processing more similar. The result was a letter-superiority effect. Redundancy 
should be important in these types of situations and it is (see Colegate & Eriksen 
30). On the question of controlling for redundancy the data of Thompson &. 
Massaro (196) conflict with those of Smith & Haviland (183). In what is the latest 
‘word, but surely not the last word, Massaro (126) demonstrated that what he called 
lateral masking, but what may well be the laterality effects of pattern masking, is 
the culprit. Controlling both for redundancy and laterality effects reduced the 
word-superiority effect to near zero. 

Repeatedly flashing the same visual stimulus improves its identification (Haber 
& Hershenson 79), but doubt has been cast upon Haber’s hypothesis that repeated . 
presentations improve the clarity of the perceived stimulus (89, 215). Doherty & 
Keeley (42) marshalled the past evidence against the hypothesis that clarity en- 
hancement is the cause of the well-documented improvement in identifiability. 
They proposed instead that the evidence strongly favors the hypothesis that 
independent perceptual inputs are aggregated from each succeeding presentation 
of the stimulus. 

The Rod-and-frame test (RFT) is a perceptual distortion of the perceived 
verticality of a luminous rod as the result of a tilted luminous square frame. The 
RFT is used as a tool in personality research. Since the phenomenon is not well ` 
understood, it is fortunate that psychophysical work is also proceeding. Increased 
exposure time increases RFT errors (84); the most obvious hypothesis is adapta- 
tion to a tilted visual field. Long (117) found that low illumination affected RFT 
scores, and series of trials in which the experimenter moved the rod as soon as the 
subject responded gave different data from a predetermined experimenter pace. 
Small (181) verbally reinforced subjects either toward or away from true verti- 
cality by saying “very good” at opportune times. The resulting perturbations in 
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judgments remained on a follow-up test one month later. If short-term condi- 
tioning procedures have long-term consequences, then what is the usefulness of 
the RFT for predicting presumably stable personality structures (field dependence 
and independence)? Psychiatric inpatients often do not understand instructions for 
the RFT (Johnson, Neville & Beutler 97), which partly explained the extreme field 
dependence found consistently among their own inpatients in the past. Walker 
(211) found no significant correlation between field dependence measured visually 
and tactually. Instead of the square frame, an open grillwork was used, which the 
subject either looked through or reached through to determine verticality. 


PERCEPTUAL MECHANISMS 


Stimulating the visual sense requires not only sufficient electromagnetic energy in 
the proper wavelength range delivered to the retina but also a temporal change in 
stimulus energy. Temporal modulation must occur on the retina via eye move- 
ments or motion of the stimulus. Stabilized-image work has shown that not only 
homogeneous fields but also patterned fields fade and disappear perceptually in 
seconds. If stabilization is complete, then nothing reappears. Tulunay-Keesey 
(199) contributed additional key evidence. In brief, slow flicker is the main de- 
terminer of target visibility under stabilized-image conditions. Arend (5) has 
begun the important work of incorporating temporal energy change into a model 
of the visual system that must, of course, be responsive to the spatial distributions 
of energy signaling contour (and thus pattern), brightness, hue, etc. 


FEATURE DETECTORS Sekuler’s (172) review emphasized feature detection; in- 
terest and research in the area has expanded rapidly, ranking with pattern ano- 
malies (illusions) in appeal. Research centers around two basic adaptation proce- 
dures. 

The simple aftereffect paradigm requires one adapting stimulus to which the 
observer attends for a specified time. Normally the observer scans the adapting 
stimulus to prevent the formation of a conventional afterimage. Stimulus dimen- 
sions most often investigated are type of pattern (usually either a line or a repeat- 
ing pattern of parallel elements—square-wave or sinusoidal grating), orientation 
(i.e. tilt), motion, and adapting contrast. The test stimulus is modified in some way 
as a consequence of adaptation. Its detection threshold is increased, supra- 
threshold contrast is reduced, perceived tilt or frequency or velocity is changed. 
Blakemore and co-workers (18) have been exploiting a derived measure devel- 
oped by Blakemore & Nachmias (19), specifying any threshold elevation, how- 
ever it is produced, in terms of equivalent contrast, i.e. the contrast of a vertical 
adapting grating that produces the same amount of threshold elevation (on a 
vertical test grating). Movshon & Blakemore (145) showed, for example, that the 
change in orientation of an adapting grating that halves the equivalent contrast is 
reasonably constant regardless of spatial frequency. The untransformed outcome, 
the range of orientations adapted (i.e. how many degrees on either side of vertical 
were adapted by a vertical grating), is heavily dependent on the particular spatial 
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adapting frequency employed. Sharpe & Tolhurst (174) used the equivalent-con- 
trast transformation to evaluate effects of grating drift and flicker. 

With respect to the specificity of neural units, there are those willing to argue 
on the basis of psychophysical data in favor of velocity-specific detectors (Tol- 
hurst, Sharpe & Hart 198) and narrowly tuned spatial-frequency analyzers (Abadi 
& Kulikowski 1). Blakemore, Muncey & Ridley (18) discussed the possibility that 
spatial-frequency channels are relatively broad, but they inhibit each other (Tol- 
hurst 197), thus sharpening the tuning of the system. “If this is the case, then 
aftereffects of adaptation may be due to prolonged inhibition rather than to 
over-excitation. ... In that case, adaptation experiments (and those involving 
simultaneous masking) may be measuring the-broad tuning properties of this 
inhibition, rather than the excitatory tuning characteristics themselves” (18, p. 
1929). 

Two important articles raise problems for the concept of spatial-frequency 
analyzers. Stecher, Sigel & Lange (188) worked with grating adaptation among 
frequencies so close as to be considered within the same spatial-frequency chan- 
nel. As one example, a frequency of 8 cycles per degree was added to an adapting 
frequency of 13.3 cycles. The combination not only did not increase the adapting 
effect on a test grating of 13.3 cycles, it actually reduced it. That is, adding an 
adjacent spatial frequency weakened the potency of the adapting stimulus. Nach- 
mias et al (147) presented a square-wave adapting stimulus, which has a Fourier 
spectrum of all the odd harmonics, and checked for a threshold increase in the 
vicinity of the third harmonic. Under temporal two-alternative forced-choice 
conditions, adaptation was not obtained, Partial success was achieved with the 
method of adjustment. 

Ocular defects in the newborn lead to selective deprivation of certain line 
orientations. The most recent selective-deprivation work with kittens continues to 
support the view that permanent acuity deficits occur (146), and that anomalies 
are found at the cortical level (17, 156). In humans loss of acuity can be demon- 
strated for orientations affected by astigmatism, and the severity depends upon the 
degree of astigmatism (143). In conjunction with the reduced resolution as a 
function of astigmatism, Freeman & Thibos (57) noted a concomitant decrease in 
evoked potentials. Annis & Frost (3) claim that different visual environments 
provide different opportunities for observing obliques compared to horizontals 
and verticals, and these experiences are mirrored in visual-resolution tests using 
gratings in various orientations. In addition to visual acuity, it has been well 
established that targets presented either horizontally or vertically show a consis- 
tent, though often small, superiority in performance over oblique orientations for 
visual tasks like stimulus detection, tracking, and judgments of orientation (Ap- 
pelle 4). Are all such meridional effects determined by early stimulation? Al- 
though gravitational cues appear to maintain orientation constancy (195), two 
studies (52, 142) showed that adaptation to gratings was dependent upon the 
retinal coordinates of the gratings. Vestibular input as a consequence of head tilt 
did not affect the adaptation process. Thus far, with counterevidehce lacking, the 
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message is unambiguous:- Astigmatism and, by implication, any ocular defect that 
blurs or distorts the retinal image, seem to lead to permanent visual dysfunction. 

The second basic type of experiment is the contingency-aftereffects paradigm 
pioneered by McCullough (131) and her students. Two values are chosen from 
each of two stimulus dimensions (e.g. hue, either orange or blue; bar grating 
oriented vertically or horizontally). Pairing one value from each dimension pro- 
vides two adapting stimuli (orange verticals, blue horizontals), which are pre- 
sented alternatingly. The test stimulus probes with one dimension to check 
whether the other is elicited. For the McCullough effect a black and white stimulus 
of horizontal and vertical bars produces complementary hues contingent upon bar 
orientation. 

Useful variations have been developed. Lovegrove & Over (119) linked two 
dimensions in one adapting stimulus (hue and grating orientation) and obtained a 
greater tilt aftereffect when the test grating matched the adapting grating in hue. 
Virsu & Haapasalo (203) alternated square-wave gratings of different frequencies, 
each paired with a different hue, and after adaptation, presenting a hue caused an 
intermediate frequency to shift away from the spatial frequency paired with that 
hue. 

Successful contingency pairings permit the inference of various types of 
higher-order neural cells. The following have been successfully paired: hue with 
curved lines (167), hue with rotation clockwise and counterclockwise (129), fine 
and coarse texture with rotation (209), hue with motion up or down (120), rotation 
with direction of gaze left or right (128). Hay & Goldsmith (83) linked head 
movements up or down with movements of a spot right or left. Woolsey & 
Newman (222) presented contrarotating disks on the same shaft. Each disk had 
transparent and dotted sectors, and subjects observed simultaneous contrarota- 
tion, one disk seen through the other. A rotation aftereffect was linked with each 
disk when rotation ceased (rotation had been paired with depth?). Stromeyer, 
Lange & Ganz (191) paired a different hue with each of two vertical gratings hav- 
ing identical frequency spectra but differing in phase angle between frequency com- 
ponents. Not all sinusoid sets were effective in producing a hue-contingent effect. 

Over, Long & Lovegrove (150) could not generate a McCullough effect with 
contour to one eye and pattern to the other; Coltheart (31) obtained no interocular 
transfer. Therefore, hue and orientation may be linked somewhat peripherally. 
_ Mayhew (130) showed how to produce movement aftereffects contingent on size. 
The effect will not transfer to a black and white reversal of the adapting pattern; it 
is short-lived, one to two seconds, but can be re-elicited many times. 

No doubt even more exotic contingencies are just around the corner. Is it 
logical to argue that every successful pairing demonstrates the existence of ana- 
lyzers for each dimension? 

Other methods of approaching feature detection are via afterimages (Atkinson 
6, Wade 205), binocular rivalry (Wade 204), and even monocular rivalry (7, 26). 
For two papers addressed to feature detection from a more cognitive point of view 
see Weisstein, Montalvo & Ozog (219) and Harmon & Julesz (81). 
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The model of spatial-frequency analyzers transmitting information along 
separate pathways can be taken literally. The work of Maffei & Fiorentini (122), 
for example, recording the responses of single neurons at various levels of the 
visual system of the cat, would indicate that a visual system is capable of coding 
and transmitting information in that manner. However, when a higher-order cell 
is found that responds to a limited set of stimulus features or a narrow frequency 
band, does that measurable neural concomitant of retinal stimulation have any- 
thing to do with the process of perception? Uttal’s (201) distinction between sign 
and code confronts the matter explicitly. A sign is a neural concomitant of stimu- 
lation that is lost at some more central level of the nervous system. A code, on the 
other hand, is a true conveyor of stimulus information; it must be both necessary 
and sufficient for producing a sensory experience. 

The neurophysiological theories of the future will address themselves to 
higher-order functions like perceived depth, and ultimately to the constancies 
(size, distance, brightness, etc). The perception of depth from stereoscopic cues 
requires the existence of disparity detectors. Three papers presented psychological 
evidence buttressed by arguments favoring the existence of such detectors (141, 
164, 166). With respect to the constancies, a variety of papers presented data that 
could not be accounted for by appealing to the retinal distance between contours. 
Instead, perceived distance including perceived separation in depth explained the 
outcomes, as though neural interaction were taking place at the level of distance 
constancy. Mershon (136) investigated whiteness contrast, Greene, Lawson & 
Godek (71) the Ponzo effect, and Attneave & Block (8) apparent motion. 
Blakemore, Garner & Sweet (16) found interocular transfer of grating-frequency 
adaptation, but did not find a size-constancy effect for gratings, i.e. when the test 
grating was the same as the adapting grating but at a different distance. 
Supporting, but perhaps rather speculative, neurophysiological evidence also ex- 
ists (124, 185). 


BACKWARD MASKING Lefton (112) reported that in metacontrast the duration of 
the mask had little effect, as did mask width (compare with 100), and that 
maximum masking occurred when the mask was presented at the offset of the 
stimulus (compare with 75). Dember & Stefl (41) with various black-and-white 
sectored test stimuli obtained detection enhancement for the metacontrast para- 
digm compared to detection in the absence of the mask. Standing & Dodwell 
(187) flashed a subthreshold stimulus upon a steady background, and if the 
background terminated shortly after the offset of the subthreshold stimulus, 
then the flash became visible. Lefton (111) reviewed the field of metacontrast. 


PRISMATIC ADAPTATION Research on touch/vision coordination and rearrange- 
ment seems to be undergoing a period of consolidation. Nonfocal factors have 
been emphasized, and at present it is difficult to provide a synthesis or integration 
of the findings. With spaced practice under prismatic viewing conditions, consid- 
erable transfer of adaptation to the unexposed hand occurs, but not under massed 
practice (Taub & Goldberg 193, but compare with Cohen 29). The goggles 
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holding the prisms are an important component of the stimulus context, and after 
adaptation, goggles alone contribute to the effects (Kravitz 106). The conditioned 
stimulus for adaptation effects need not be a relevant aspect of adaptation. A 
concomitant tone during adaptation also acquired the properties of a conditioned 
stimulus (107). In an important experiment, Welch (220) employed a pseudopris- 
matic situation in which the observer viewed the luminous finger of another 
person that was displaced as though the observer were viewing his own finger 
through prisms. Prismlike adaptation occurred even for informed observers who 
knew that the luminous adapting finger was not their own. Walker (210) demon- 
strated the natural visual capture that occurs with a size anomaly in the visual field. 
Cognitive factors are important (139, 200). Unnoticed head deviations systemati- 
cally occur (108). Passive arm movements led to adaptation when the arm re- 
mained outstretched (Baily 9); passive adaptation depended on the use of a 
structured visual background (Melamed, Halay & Gildow 135). But perhaps we 
are too enraptured by visual capture (McDonnell & Duffett 132). 


APPLICATIONS 


A report on the value of the stereoscopic viewing of aerial photographs by 
Zeidner, Sadacca & Schwartz (224) is a neglected classic. Experienced photoin- 
terpreters, using aerial photographs obtained on actual missions, were asked to 
detect and identify objects of military significance. (No depth information is 
necessarily required for such a task.) Contrary to the prejudices of even today’s 
image interpreters, there was no evidence favoring stereo over nonstereo proce- 
dures (which employed the same photographs with one of the stereo pair removed, 
and a plausible cover story concerning the absence of stereo) on the task. Some 
measures showed a slight superiority for nonstereo. 

Facial recognition, in light of its importance in eyewitness testimony in court, is 
a worthy topic. Shepherd & Ellis (176) found that for a set of female faces mem- 
ory was excellent for those considered as either beautiful or ugly, but not as good 
for the remainder. A cross-racial bias has been shown favoring remembering the 
faces of members of one’s own race (123). However, a short-term training session 
of 100 trials on white subjects obliterated the bias. Galper (59) found the usual 
cross-racial bias except that white students enrolled in a Black studies course 
showed better recognition for black faces. 

Reading is an activity with perceptual components. The danger for those in 
perception is to treat it as a purely perceptual phenomenon. A collection of papers 
on the relationship between speech and reading (101) contains excellent selections 
to titillate theorist and experimentalist. Frank Smith (184) has proposed an 
information-processing/psycholinguistic/cognitive (George Miller/Chomsky/ 
Bruner) theory of reading, which is presented in sprightly and persuasive 
style in a little book called Understanding Reading. I have used the book as a 
text in an undergraduate perception course and find that a great deal of the 
modern data and theories of perception can be integrated into a topic that appeals 
to many students. Baron (10) published evidence that supports Smith’s stance 
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quite nicely. Baron argued from his data that a reader does not need an interme- 
diate phonemic code to read. The reader can instead proceed directly to meaning 
from a visual analysis. Guthrie (77) used children with a reading disability to 
investigate two classes of reading models. Two groups of normal children served as 
control subjects, one matching the disabled children for age, and the other, much 
younger, matching for level of reading ability. 


METHODOLOGY 


The word is out. Beware of response bias. Consider forced-choice procedures as a 
substitute for the less tedious and more informative method of adjustment. There 
is merit in testing crucial data using a forced-choice arrangement as a check [note 
the Nachmias, Sansbury, Vassilev & Weber (147) study]. Even checking a few data 
points with each method is valuable. One owes it to oneself as an informed 
scientist to grasp the basics of signal-detection theory. Try McNicol’s primer (133). 
Swets (192) again argues convincingly about the value of the approach; response 
bias cannot, in fact, be overlooked or disregarded; interpreting data in terms of an 
operating characteristic has superior practical advantages. Simpson & Fitter (180) 
discuss various indices of detection and give reasons for recommending a partic- 
ular one, d,, which is equal to \/2 times the normal deviate of the area under the 
ROC curve. Consider the advantages of staircase procedures in which the ob- 
server's response determines the stimulus sequencing, keeping the stimuli being 
presented within the observer’s uncertainty region. Simpler versions are an 
efficient variant of the method of constant stimuli. If necessary, the procedure can 
be used with small numbers of observations; statistical computations are easy. [A 
brief discussion and examples of the procedure in action appeared in (214).] The 
staircase method should be compared with Herrick’s (88) random method of 
limits, a constant-stimulus variant that I do not recommend. Without adaptations 
the staircase approach will not guard against response bias. However, null-match 
judgments that are not context free cannot be safeguarded anyway. 

Wright (223) used signal detectability procedures to determine the wavelength 
discrimination function of the pigeon. Compare signal detection with pigeons 
(Wright) to the method of constant stimuli with humans (Siegel & Siegel 178); 
both experiments probe wavelength discrimination. [Not everyone is enchanted 
with signal detectability. See Ellis (46), Wagenaar (206), and Hébert (85).] 

A model psychophysical experiment by Kelly & Savoie (102) compared the 
method of adjustment with a modified staircase procedure. A forced-choice de- 
cision was required to choose which of the two temporal intervals the stimulus 
appeared in. Stairstepping to the next level was contingent upon whether the 
subject had chosen the incorrect interval. This paper and the Wright paper cited 
above should be required reading for practicing psychophysicists. Incidentally, the 
data obtained in detecting sinusoidal gratings by the forced-choice staircase 
procedure differ very little in shape from the adjustment data. Overall sensitivity 
was shown to be far better by the staircase method. 

Another way to avoid certain types of response bias is to use an “involuntary” 
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response. Metz & Balliet (137) assessed human scotopic luminosity functions by 
means of optokinetic nystagmus generated with moving stripes. Kleespies & 
Wiener (104) investigated subception with an initial eye-movement response 
(orienting reflex) not normally monitored consciously. ) 


IN CONCLUSION 


Two topics lying outside the traditional lines of perceptual research deserve 
mention because of uniqueness. Harris (82) plotted the log times against log 
distances of current world sports records in swimming, running, walking, speed 
skating, and horse racing. The data are strongly linear for each type of activity and 
slopes are nearly identical. Thus the data conform closely to Stevens’ power law 
- with an all-encompassing exponent of 1.1! Harris hazards a perceptual explana- 
tion. Hypotheses anyone? 

Suppose that an intense flash of light forms a long-lasting afterimage of one’s 
own hands being held in front of a grid of lines. Now, in total darkness, one hand 
is lowered “out of the picture” of the afterimage. Suppose instead that the after- 
image of a corridor is produced, and then one physically walks down the corridor 
in the dark. Davies (36, 37) collected subjective reports. A black hole often 
appeared in the afterimage where the hand had been; the sensation of walking 
along the corridor, i.e. through one’s own afterimage, was also reported. 

For me, the phenomenological aspects are an appealing aspect of the study of 
perception. Given that a physical world exists, what is the nature of the pheno- 
menal world? How did it get that way? What can be determined about the neural 
substrate without which there is no perception? Some pieces of the puzzle are 
amenable to scientific investigation. For the investigators involved, I propose the 
title, Experimental Phenomenologist. Much of the puzzle invades the domain of 
philosophy. An essay by Globus (65) argues that monist and dualist views are 
symmetric, and treats the symmetry in terms of Bohr’s complementarity principle. 
Be advised that the arguments are presented clearly, the philosophical termino- 
logy is not overpowering, and the arguments are likely to appeal to the experi- 
mental phenomenologist. 

Progress comes slowly. Impressive improvements in psychophysical techniques 
and apparatus technology can be discerned. Commendable work has been ac- 
complished in defining issues and constructing theories that can account for highly 
selected subsets of experimental findings. Important conflicts in the data base of 
perception have yet to be resolved. We now have bigger and more sophisticated 
controversies, but how many issues have been laid to rest? 

If I were to try to capture the essence of the area of perception in a word, then 
that word would be ferment. These are exciting times for perception. Nevertheless, 
the advantages that accrue to hindsight will be required to determine whether 
1972 and 1973 were truly vintage years. 
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HISTORICAL TRENDS 


The last review of memory and verbal learning by Tulving & Madigan (180) 
painted a deeply pessimistic picture of the theoretical and experimental progress 
during the nearly 100 years of the post-Ebbinghaus era. As these critics saw it, 
tireless investigators compiled endless measurements of performance in labora- 
tory tasks which bear little or no relation to the complexities and subtleties of 
human memory. For the most part, the experiments demonstrated and quantified 
what Aristotle expounded in the remote days of antiquity—that memory is likely to 
reflect the contiguity and similarity of past events. While that proposition may. 
have some empirical validity, it offers no purchase on the dynamics of remem- 
bering. Yet the students of learning and memory were excruciatingly slow to break 
out of the constraints imposed by the pretheoretical schema of mechanical 
chaining of mental representations. A discouraging manifestation of the futility of 
it all was the fact that laboratory procedures, such as serial and paired-associate 
learning, which obviously can be no more than a means to an end, became an 
object of experimental and theoretical analysis, i.e. an end by themselves. Thus it 
came about that in the middle of the twentieth century, man still thought “about 
his own memory processes in terms readily translatable into ancient Greek” 
(180, p. 437). It is only fair to add that the picture painted by Tulving and Madigan 
was not one of unrelieved gloom. They saw many promising leads as attention 
shifted away from the traditional problems of association to questions about the 
processing, storage, and retrieval of the multiplex information to which man is 
exposed both inside and outside the laboratory. 

This chapter is being written about 5 years after the last review, but the lines of 
historical development appear quite different to the present writer, and not only 


! The preparation of this report was facilitated by grant MH 12006 from the National 
Institute of Mental Health. 


A 291 


292 POSTMAN 


because of the advances and changes in the field that have occurred during the 
intervening period. Perhaps the first point that is worth making is that the anti- 
quity of concepts is not a measure of their triviality in psychology any more than it 
is in mathematics. Contiguity and similarity have, in fact, turned out to be fun- 
damental determinants of remembering. We know a great deal now, but not 
nearly enough, about their mode of operation. Those who have made a deter- 
mined effort to break out of the traditional mold have nevertheless been con- 
strained to treat contiguity and similarity as fundamental variables, for example in 
the formulation of the principles of grouping in perception and memory, origin- 
ally proposed by the Gestaltists and now newly demonstrated by proponents of 
organization theory (20). It is essentially by virtue of contiguity and similarity that 
higher-order units in memory are assumed to be formed, although one may now 
prefer to describe the consequences as seriation (103) and subjective organization. 
The examples could be multiplied to show that there is an abundance of old wine 
in new bottles on the shelves of those who are deeply disappointed in the plodding 
accomplishments of the past. 

It is not the age and the tradition-encrusted aura of the concepts and variables 
that matter, but the use to which they are put. Consider, for example, frequency, 
which is no less hoary a principle than contiguity and similarity in the history of 
thought about memory. One of the most promising and productive approaches to 
the problem of recognition today is built on the elegantly simple assumption that 
the discrimination between old and new events is based on the attribute of 
frequency (186). Let us also note that the most recent, and probably most sophis- 
ticated and comprehensive, model of human information processing and memory 
is described by its authors as a neo-Aristotelian theory of association (3). This is a 
model which is written in the language of computers; thus the voice of Aristotle is 
heard again in this electronic age.- 

One cannot judge the fruits of past labors just by the advances that have been 
made in theory. It is necessary to consider the cumulative growth in empirical 
knowlege as well. Tulving and Madigan tell us that of the many hundreds of 
papers they examined they were able to classify less than 10 percent as 
“worthwhile.” These were papers that clarified existing problems, “opening up 
new areas of investigation, and providing titillating glimpses into the unknown” 
(180, p. 442). As for the rest of the publications, they were consigned to the 
categories of “utterly inconsequential” or “run-of-the-mill.” We must concede 
without demurrer that the literature on verbal learning and memory contains 
more than its share of hack products reporting trivial parametric manipulations 
and unnecessary variations on old and tired themes. It is (perhaps deceptively) 
easy to do routine studies of verbal learning, and many are done because the 
variables are there to be manipulated. There would probably be fair agreement on 
what is “utterly inconsequential.” 

it is the evaluation of the “run-of-the-mill” work that is open to debate. There 
have been few, if any, truly crucial experiments on memory which signaled a 
theoretical breakthrough. Unfortunately, unexpected and counterintuitive results 
more often than not ultimately are found to be artifacts or interpretable in terms of ` 
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already known principles. The theoretical blast of Rock’s thesis of one-trial 
learning (148) ended on a whimper amidst confusion about uncontrollable varia- 
bles and uncertainties of measurement (147). A recent cause in point is the 
phenomenon of negative part-to-whole transfer (175), which we discuss below. In 
the current state of the psychology of verbal learning and memory, theoretical 
advances are most likely to occur through the integration of stable and repeatable 
experimental results and the resolution of apparent inconsistencies among ob- 
served empirical phenomena. The “run-of-the-mill” studies are useful in contri- 
buting stable facts and in alerting us to the lack of order in empirical relationships. 
Thus the products may be valuable, even though the context of discovery may now 
be judged by some to have been routine or pedestrian. 

A few examples will help to show how new and challenging theoretical issues 
are anchored to (and perhaps in some cases arise out of ) both the orderliness and 
the disorderliness of run-of-the-mill facts. One of the central issues in current 
discussions of the nature of memory is the validity of the distinction between a 
short-term and a long-term store. The resolution of this question has hinged in 
large measure on the reexamination of the effects of such old and familiar varia- 
bles as serial position, presentation rate, and the similarity between the memory 
task and the activities filling the retention interval (e.g. 54, 87). Theorists knew 
what the expected effects in conventional rote-learning situations were; by show- 
ing that these relations failed to materialize in certain tests of short-term retention, 
they were trying to make a case for a separate and different memory system. 

Consider next the conditions and characteristics of encoding which today oc- 
cupy the center of the theoretical stage—encoding variability (106, 115), encoding 
persistence (61), and encoding specificity (182), dimensions of encoding or depth of 
processing (36), and so forth. In each case the arguments pro and con take their 
point of departure from phenomena of old standing; new experimental arrange- 
ments are devised to permit these classes of phenomena to exhibit the operation of 
the postulated intervening mechanisms. Investigations of the effects of spacing on 
recall and reexaminations of the conventional paradigms of paired-associate 
transfer have been the primary testing grounds of the principles of encoding 
variability and persistence. Contextual stimuli at input and output have been 
manipulated to demonstrate encoding specificity, and the time-honored question 
of differences between recall and recognition has emerged as a critical dependent 
variable in the evaluation of this hypothesis. Those concerned with levels and 
varieties of processing have brought us back to an examination of the conse- 
quences of sheer repetition and the variations in incidental memory attendant 
upon the performance of different orienting tasks. We will comment on these 
theoretical developments later. For the present, these examples are intended to 
serve as a reminder of the symbiotic relationship between patient fact-finders and 
innovative theorists. What kinds of evidence are “worthwhile” and “run-of-the- 
mill” changes continuously, and there is a high risk in applying these categorical 
judgments prematurely. 

The degree to which one sees the field of verbal learning and memory as static or 
changing depends on his temporal perspective. In a broad sweep over the “period 
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from approximately 350 B.C. to 1969 a.D.” (180, p. 437), Tulving and Madigan saw 
painfully little change. When one focuses on the brief span of the last two decades 
or so, recent changes in concepts, theories, and methods appear drastic and loom 
large. During these years, the methodological constraints of behaviorism and the 
frugal tenets of functionalism were eagerly set aside by many influential investi- 
gators. Concepts that had lain dormant because they had appeared to lack com- 
plete scientific respectability made a spectacular comeback—organization, im- 
agery, attention, consciousness, to name just a few. The language of the computer 
replaced the vocabulary of habit theory and conditioning: the basic sequence of 
events in learning and remembering is described in terms of coding, processing, 
storage, and retrieval. Model building became a major preoccupation. The 
number of models—boxes joined by intricate arrangements of arrows and loops, 
flow diagrams, computer programs, systems of equations, and multifarious com- 
binations thereof—proliferated at a remarkable rate. Today the ratio of models to 
experiments is quite high, although happily still less than unity. As part of the 
modelers’ armamentarium, the proverbial homunculus has gained a measure of 
implicit tolerance, sometimes in the guise of an executive routine, selecting codes, 
deciding on processing strategies, and searching through the recesses of the mem- 
ory store(s). None of these pretheoretical and theoretical orientations is intrin- - 
sically good or bad. There is no question but that investigations formulated within 
an information-processing framework have yielded important facts about learn- 
ing and memory. The critical issue which has to be faced is whether the multipli- 
city and complexity of hypothetical mechanisms are outdistancing the observable 
facts at a rate that is cause for alarm. This question is the underlying theme of the 
selective review of the recent literature that follows. 

One more point about the current intellectual climate may be in order. Against 
the background of the functionalist tradition, the developments which we have 
described had the earmarks of a theoretical and methodological revolution. Now it 
is a commonplace observation that revolutionaries tend to turn conservative once 
they seize the reins of government. It is to be feared that intellectual revolution- 
aries are no exception. There are worrisome signs that a new orthodoxy and a 
normative nominalism may be in the making. Consider, for example, the follow- 
ing remarkable statement which appeared in the discussion section of a paper in 
which the authors argued for an interpretation of incidental learning in terms of 
type of processing as opposed to differential responding to the stimulus items: 


Of course, we do not understand processes better than responses nor do we argue that 
processes are more objective or more scientific. Indeed, we probably know as little 
about processes as we do about hypothetical responses. We do argue, however, that the 
notion of processes involved in a task offers more adequate, productive, and interesting 
experimental ideas than the relatively constricted (and equally hypothetical) notion of 
responses (77, p. 479). l 


Surely the name that one attaches to an hypothetical event should not matter in 
and of itself when the nature of the event is only vaguely understood. We must 
guard against the obvious fallacy that a change to a preferred language entails, or 
even holds out the promise of, better understanding. 
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THE SHORT AND HAPPY REIGN OF 
DUAL-PROCESS THEORY 


Few theoretical ideas have influenced our field of research as profoundly as the 
formulation of the multiple-store conception of memory. While there have been 
various versions differing in detail (cf. 130), the modal scheme encompasses three 
stages: a sensory register, a short-term store and a long-term store (e.g. 4). Since 
our concern here is with verbal learning and memory, we will not consider the 
sensory component any further but will address ourselves only to the dichotomy 
between a short-term store (STS) and a long-term store (LTS). The question to be 
examined is how tenable this distinction appears to be in the light of the most 
recent experimental evidence. 

Before the present status of the theory is appraised, it is well to remind ourselves 
of the type of evidence that motivated and sustained the formulation of a dual- 
store hypothesis. The finding that massive forgetting ofasubspan amountof material 
can occur in a matter of seconds, as in the Brown-Peterson paradigm, was un- 
doubtedly influential but would not have been sufficient in itself. There is nothing 
in the fact that forgetting is much more rapid under some experimental arrange- 
ments than others which compels the inference of qualitatively distinct processes. 
The conviction that there are separate stores was founded on indications that 
short-term and long-term traces had different properties, and that the establish- 
ment of the latter was dependent on the survival of the former. As is well known, 
the major lines of evidence which have been invoked to support a functional 
dichotomy of memory processes include (a) the specificity of memory deficits in 
brain-damaged patients; (b) apparent differences in encoding related to the length 
of the interval between input and retention test; (c) the effects of presentation 
modality on performance; and (d) the interaction of the serial position at input 
with a variety of experimental manipulations, with recall of the terminal items 
presumably reflecting output from STS and that of early and middle items from 
LTS. 

While models vary with respect to the characteristics assigned to the two stores, 
there is substantial agreement on the major functional differences between the 
hypothesized long-term and short-term systems. A useful composite summary of 
these differences was recently presented by Craik & Lockhart (36): STS is a system 
of limited capacity in which information is maintained by continued attention and 
rehearsal. The format of the information is predominantly phonemic, probably 
also visual, and “possibly semantic.” The trace is of short duration (up to 30 sec) 
and loss from the store occurs through displacement or “possibly decay.” Retrieval 
is likely to be automatic or dependent on temporal and phonemic cues. Informa- 
tion that has been maintained by rehearsal in STS is transferred to LTS; a 
common assumption is that the probability of transfer is a direct function of the 
duration of an item’s stay in STS (4, 54, 131). Once information has entered LTS, 
which has no determinate limits, it is maintained by repetition or organization. 
The format of the information is largely semantic but includes some visual and 
auditory representations as well. The long-term trace is of indefinite duration. 
Successful retrieval depends on the availability of appropriate cues and may be 
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the outcome of a search process. Forgetting occurs when the stored information 
becomes inaccessible because of the loss of retrieval cues or discriminability is 
degraded through interference. 

The title of the present section was chosen to reflect an emerging consensus that 
this compact and well articulated schema is rapidly losing both face validity and 
empirical support. The change in the climate of opinion is made all the more 
impressive by the fact that some of the most vigorous and persuasive criticisms 
have been advanced by investigators who in the past have contributed substan- 
tially to the supporting evidence (121, 167, 201). In discussing the experimental 
evidence, we will refer to the salient features of the two hypothetical stores listed 
by Craik & Lockhart (36). Since there have been several comprehensive reviews of 
the literature during the last few years, we will cite just a few of the many relevant 
recent studies. i 


Coding in STS 


Note that Craik and Lockhart described the format of information in STS as 
“phonemic, probably visual, possibly semantic.” This guarded and tentative de- 
scription appropriately reflects the increasing weight of the evidence against the 
proposition that the coding of material in STS is necessarily or exclusively 
acoustic. The origin of this somewhat counter-intuitive hypothesis can be traced in 
the first instance to the identification of STS as a holding mechanism in which 
information is maintained through rehearsal. In order to be rehearsed, visual 
input has to be recoded into an auditory form (173). 

It is less clear why it was further assumed by at least some theorists (6, 7, 90, 125) 
that semantic coding does not occur in STS but only in LTS. Such an assumption 
may have reflected in part a search for characteristics which would differentiate 
sharply between the two systems. It can also be seen as related once more to the 
rehearsal function of STS. Semantic processing of the items reduces the amount of 
time and effort that can be devoted to maintenance rehearsal. Atkinson & Shiffrin 
(4) pointed out explicitly that a trade-off might occur in STS between maintenance 
rehearsal on the one hand and elaborations and transformations of the input on 
the other. They suggested that the extent to which such a division of effort did in 
fact occur in any given situation was under the control of the learner. 

Other exponents of dual-process theory, notably Glanzer (54), put the mne- 
monic organization of the material squarely in STS. The probability that a mne- 
monic relation can be established and used is seen by Glanzer as dependent on 
how closely the relevant items follow each other, i.e. how likely they are to be 
together in STS. Norman & Rumelhart (131) suggest that the primary distinction 
between long-term and short-term memory lies in the degree of permanence with 
which contextual information is attached to stored representations. “When infor- 
mation is retrieved, all that matters is the number of attributes that are still | 
retained: no distinction is made between short- and long-term storage” (p. 29). In. 
retrospect, one cannot help but wonder why the proposition that semantic en- 
coding does not occur in STS attained the prominence that it did. Support for this 
particular hypothesis has surfaced only occasionally (e.g. 90, 98). Nevertheless, 
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a significant proportion of the papers on short-term memory during the last few 
years was directed to the disproof of the hypothesis. There appears to be a 
functional autonomy of hypotheses as well as of methods (180). 

It is necessary then to distinguish between two quite separate questions 
subsumed under the hypothesis that coding in STS is primarily phonemic. First, 
can information entering in a nonverbal format be maintained only if it is recoded 
phonemically? Second, can information be coded semantically in STS and what 
are the conditions determining whether it is or not? The evidence most commonly 
cited for phonemic recording of visual inputs is (a) the impairment in the recall of 
visually presented letters as the acoustic similarity of the letters is increased, and (b) 
the correspondence between the pattern of errors in recall to the confusions that 
occur in tests of auditory perception of the letters (30). It is doubtful that any 
general conclusions about the coding limitations of STS should have been drawn 
from such results. The only valid inference is that phonemic coding does occur in 
STS, not that it is the exclusive form of coding (cf 62). Visual coding could have 
occurred without being reflected in confusion errors, perhaps because the visual 
and phonemic traces decay at different rates (201). In any event, it has now become 
apparent that the mechanisms underlying phonemic confusion errors are more 
complex than had been originally supposed and that there is no simple and direct 
line of argument from the occurrence of such errors to fixed coding characteristics 
of STS. 

Two important recent studies bearing on this question should be mentioned 
here. Peterson & Johnson (138) returned to the question raised earlier (66, 123) of 
the relationship between articulation and acoustic confusion effects in immediate 
recall. To minimize articulation, subjects were required to count aloud during the 
visual presentation of letter strings. The input sequences were of either low or high 
acoustic confusability. Control subjects remained silent during the presentation of 
the strings. The usual inverse relation between acoustic similarity and recall was 
found in the silent condition, but this effect was entirely eliminated when counting 
was required. In a second experiment, the effects of silence and counting were 
compared under conditions of auditory presentation. As in the previous situation, 
performance was substantially reduced by counting but the acoustic similarity 
effect was now present under both conditions. The conclusion was that both 
acoustic and articulatory cues can function in short-term retention. “The subject 
will use what is convenient” (p. 350). That is, he adds articulatory cues when 
presentation is visual and utilizes acoustic ones when presentation is auditory. 

In the experiments of Estes (47, 48), the dependence of phonemic encoding on 
vocalization is clearly demonstrated. As expected, confusions among acoustically 
similar letters presented visually rose to a sharp peak and then declined as a func- 
tion of retention interval when vocalization was permitted at input. However, 
when vocalization was prevented by the concurrent performance of a verbal task 
(categorizing the individual letters according to their position in the alphabet) or 
by reduced exposure duration, the probability of confusion errors was no greater 
for acoustically similar than dissimilar letters (at chance). Furthermore, there was 
evidence for phonemic encoding when vocalization was suppressed during input 
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but a brief pause for rehearsal was introduced within less than 2 sec after the visual 
display. It appears, therefore, that nonauditory representations of the letters re- 
main available for rehearsal for a second or more even though other visual 
characters have been displayed in the same location. Estes concludes that the 
phonemic encoding of visually presented items is not effected by a central recod- 
ing process; rather it is based on an auditory-articulatory input which occurs in 
parallel with or subsequent to the visual presentation. It is important to note that 
in both the studies we have summarized retention is well above chance even when 
vocalization is prevented and the use of acoustic cues is minimized. It is true that 
vocalization enhances item availability, but some of the information is maintained 
without vocalization, although it remains to be determined how. 

The question of how the trace of the visual input is maintained for a delayed 
phonemic readout is of considerable theoretical importance. As Estes points out, it 
would not be reasonable to explain this finding in terms of the maintenance of 
visual features in a short-term visual memory system. Under the fast rate of 
presentation used, the image of a given letter would have been overwritten during 
the period of delay by subsequent items displayed in the same location. Existing 
models obviously do not provide an exhaustive account of the ways in which 
information can be maintained for purposes of further processing immediately 
after input. The idea that an auditory-verbal-linguistic format is essential for 
sustained short-term storage has more generally been called into serious question 
by the results of a series of recent studies comparing retention for visual and 
auditory inputs. 

In the experiments by Kroll, Parkinson, and their associates (91, 93, 94, 135, 136, 
159) the subjects are engaged in shadowing aurally presented letters while trying 
to remember a single “memory letter” that is presented either visually or aurally. 
The rationale of this manipulation is that the amount of interference produced by 
the interpolated shadowing task should be equivalent if the visual display were 
recoded into an auditory format. The results consistently fail to confirm this 
prediction. Immediate retention of the auditory and visual letter is equivalent or 
may favor the former; however, after a shadowing interval of the order of 10-20 
sec, retention of the visual memory letter is superior. Phonemic similarity between 
the target letter and the interpolated shadowing material reduces recall when the 
presentation is aural but not when it is visual. This finding reinforces the conclu- 
sion that the visual input is not recoded into an auditory format in this situation. 
Furthermore, immediate and delayed serial position curves are similar in the 
recall of visually presented stimuli, showing both primacy and recency effects. 
With auditory presentation, only a primacy effect is typically obtained after a 
delay. 

It is interesting to note that superiority of visually presented trigrams was 
recently observed in the standard Brown-Peterson paradigm, with backward 
counting as the distractor activity (160). This result is reasonable when the simi- 
larity (with respect to the modality engaged) between the original and the inter- 
polated task is considered. The apparent persistence of visual information over the 
short term, although not to a greater extent than of auditory inputs, has been 
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demonstrated under other experimental arrangements as well (e.g. 96). Taken 
together, these studies make it clear that coding in an acoustic format is not 
mandatory for short-term retention, even though that format may be dominant in 
many situations. It would be unfortunate if recognition of this state of affairs led to 
a subdivision of the storage compartment into autonomous auditory and visual 
boxes. Some investigators are tempted to speculate along these lines. A more- 
reasonable conclusion would seem to be that if one has to think in terms of a 
short-term store, one should conceive of it in functional rather than structural 
terms (138). 

A brief general comment: on modality differences may be interjected here. 
Except in special situations like those used by Kroll et al, performance is typically 
better after auditory than visual presentation. The difference is usually confined to 
the terminal items in a list (e.g. 32, 101, 122, 197). When visual and auditory 
presentations are mixed within a list, recall tends to be organized by modality (122, 
129). The effects are impressive, but it remains rather unclear what they imply. 
A variety of explanations have been suggested (cf 121, 197). One hypothesis which 
has received considerable attention is that the input‘to STS may be augmented 
from a precategorical acoustic store (39, 120). However, such an interpretation 
appears to be untenable since decay from this store is too rapid to encompass the 
time span over which the modality differences have been shown to extend (121). 
To postulate separate visual and auditory prelinguistic stores which differ in 
capacity (122) is essentially to restate the explanatory problem in structural terms. 

Recent experiments have shown that the advantage of auditory over visual 
presentation is independent of word length in both free and serial recall (197, 198). 
Since the unit of information in the precategorical store must be taken to be less 
than a word, the results indicate that the modality effect probably reflects a 
postcategorical stage of processing. Watkins (197) suggests that recency and 
modality effects, which are both independent of word length, are determined by 
the same process in primary memory, but the exact nature of this process remains 
to be specified. In any event, as Murdock (121) points out, the modality effect 
presents a problem for dual-store theories which assume that items pass from STS 
to LTS since the superiority of the auditory mode is usually limited to the recency 
portion of the serial position curve. 

We turn now to evidence bearing on the question of whether or not semantic 
encoding occurs in STS. As already noted, the hypothesis that it does not was 
rather implausible to begin with, nor has it been entertained in a serious way by 
many dual-store theorists. It is sufficient, therefore, to cite the recent careful 
experiments by Shulman (166, 168) which leave little doubt that the semantic 
characteristics of words can significantly influence performance in a short-term 
memory situation. Shulman used a probe recognition procedure in which the 
subject was required to judge whether a test item was identical to one of the words 
in a 10-word list, a synonym of one of the words, or a homonym. The subject did 
not know in advance which type of probe would occur and thus was presumably 
forced to encode the items semantically as well as phonemically. The functions 
relating correct recognition to serial position were closely similar in shape and in 
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each case showed a pronounced recency effect. While the level of recognition for 
synonyms was high, it fell below that for homonyms. The implications of such a 
difference are equivocal, however, since the degree of semantic similarity cannot 
necessarily be assumed to be as great as that of phonemic similarity. In a subsi- 
diary experiment Shulman found that subjects’ ability to recognize synonymity is 
less than perfect even when memory is not involved. It may be noted in passing 
that comparing the effects of two types of similarity is always extremely risky, 
precisely because of the difficulty of equating the degree of relationship along the 
different dimensions. ) 

Shulman’s results show that semantic encoding in a short-term memory situa- 
tion can be highly effective when the task demands it. The author proposes a 
principle of time-dependent encoding: it takes longer to process an item seman- 
tically than to process it phonemically. Hence subjects may confine themselves to 
the latter dimension when that is sufficient to meet the experimental requirements. 
This interpretation receives some support from the fact that recognition of syn- 
onyms improves as rate of presentation decreases, whereas that of identical and 
homonymous targets does not. There remained the unlikely possibility that words 
are always stored phonemically but are recoded into a semantic form when 
synonym recognition is required. This hypothesis was tested in an ingenious 
experiment by Raser (144) and not unexpectedly found wanting. Hopefully, the 
question of whether words can be encoded semantically in short-term memory 
situations is now settled. The idea that people would mouth the sounds of words 
for several seconds at a stretch without knowing what they were saying was always 
a bit unsettling. 


Rehearsal and Transfer to LTS 


The typical flow chart of a dual-process model shows some, but not all, of the 
information that has entered STS as being transferred to LTS. Exactly how this 
transfer might occur is usually not specified. We see an arrow pointing from one 
box to another, but few if any theorists would have wanted us to take the box 
schema literally, perhaps signifying that excitation flows from a short-term to a 
long-term location. In any event, it seemed reasonable to suppose that the proba- 
bility of transfer was related directly to the length of an item’s stay in STS. While 
an item remains in STS, it can be maintained through rehearsal, the corresponding 
LTS trace is built up, and there is time for useful coding operations and other 
storage processes to be accomplished (4, 54). Apparent strong support for this 
conception was obtained in experiments on free recall when the frequency with 
which individual items were rehearsed during study trials was correlated with the 
probability of their being reproduced on a subsequent test of retention (154, 156, 
157). This correlation was very high (thus accounting for the effects of primacy), 
with the notable exception of the last few positions in the list. For the terminal 
portion, the frequency of rehearsals was very low but the probability of recall was 
high, particularly in the last position. The last few items may be retrievable from 
STS, and to that extent the probability of recall should be independent of the 
number of rehearsals. That is, an item is available and accessible if itis still in STS, 
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regardless of how often it has been rehearsed. The earlier items, however, must be 
retrieved from. LTS; hence the probability of their recall is determined by the 
amount of information in LTS, which in turn is a function of the number of 
rehearsals. 

This line of reasoning was further strengthened by the demonstration of a 
negative recency effect in delayed free recall (33, 35). When subjects learn a series 
of lists, each presented for a single study trial, a pronounced recency effect is 
obtained on the immediate test of recall for each individual list. However, when 
recall of all the lists is requested at the end of the session, the items that had 
occupied the terminal positions in the successive lists are remembered least well. 
It had, of course, been known that the recency effect in single-trial free recall is 
eliminated by a few seconds of interpolated activity (54); the additional finding 
now was that in final free recall for a series of lists retention for the terminal items 
fell below that for the middle ones (hence the phrase, negative recency). The 
negative recency effect has considerable generality: it has been found with serial as 
well as free recall (27), with probe recall of paired associates (102, 110), and holds 
both for items that were recalled on the immediate test and those that were not 
(although initial recall raises the level of final retention). It has been observed for 
recognition as well (35, 41), although not invariably so (27). The obvious explana- 
tion within the framework of a two-store model is that terminal items, while 
highly accessible from STS on an immediate test, are inadequately registered in 
LTS, in fact more inadequately than any other items, and hence cannot be 
retrieved from LTS on the final test. 

So far so good. Unfortunately, subsequent work has shown that in spite of its 
face validity the dual-process interpretation of the negative recency effect was 
apparently wrong, at least insofar as inadequate registration in LTS was attributed 
to shortness of stay in STS and lack of opportunity for rehearsal. Several related 
experiments have now converged on the conclusion that long-term retention is not 
determined by either the sheer amount of rehearsal or the duration of an item’s 
stay in STS. Rather what is critical is the type of rehearsal or processing that takes 
place during the input period. Rehearsal that serves only to maintain the imme- 
diate availability of an item, such as simple repetition, does little if anything to 
enhance subsequent recall. Active processing of the item—elaboration, transfor- 
mation, recoding, and so on—does enhance subsequent recall. For purposes of 
further discussion we will adopt the terminology used in a recent article and 
distinguish between maintenance rehearsal on the one hand and constructive 
rehearsal on the other (213). Some of the major findings which appear to lend 
support to this distinction will be summarized briefly: 

1. When a distractor task is performed after the presentation of each of a series 
of lists, recall of the terminal items is depressed as compared to conditions in 
which intervals of either silent or overt rehearsal periods are interpolated. How- 
ever, on the final test of recall for all lists, retention of these items is highest under 
the distractor treatment. Similar results are obtained with lists of subspan length 
(78, 79). The proposed interpretation is that in anticipation of the rehearsal-pre- 
venting activity the critical items are processed more thoroughly, i.e. rehearsal 
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becomes more constructive. Since no differences are found on a final test of 
recognition, the generation of effective retrieval cues is implicated. An examina- 
tion of the results suggests that such an interpretation should be made with 
considerable caution. While the differences in final performance are significant, 
they are by no means of impressive proportions. The subjects’ ability to improve 
their efficiency of processing must have been rather limited. Perhaps more im- 
portant, with reference to immediate recall, a comparison is effectively made 
between the final retention of lists varying in length. For various reasons, losses 
may be greater for a relatively long than short list. Such pedestrian variables have 
a way of intruding themselves and should not be overlooked. 

2. Craik & Watkins (37) undertook to manipulate directly the length of STS 
storage and the amount of overt rehearsal for items to be recalled subsequently, 
presumably from LTS. Neither storage time nor the number of overt rehearsals 
was directly related to long-term recall. One of their experiments bears directly on 
the interpretation of the negative recency effect. Subjects were induced to rehearse 
aloud the terminal items in successive lists as often as the initial ones. If the poor 
delayed recall of terminal items is due to a lack of rehearsal, the retention deficit 
should be eliminated under these conditions. It was not. It appears that the 
additional rehearsal was not constructive enough. One is left with the question, 
however, of whether massed rehearsals of terminal items should be expected to be 
as effective as the typically distributed rehearsals (156) of the initial items. 

3. In the context of a study of directed forgetting, Woodward, Bjork & 
Jongeward (213) investigated the relation between amount of rehearsal and 
subsequent retention. For present purposes, the critical manipulation was that the 
cue to remember or forget was presented for individual words after varying 
periods of delay. The subjects had to hold each word in memory until the cue 
appeared. The length of the rehearsal period had no effect on the initial recall of 
the to-be-remembered items nor on the final recall of either the to-be-remembered 
or the to-be-forgotten items [similar results had been reported previously (43, 
212)]. There was, however, a strong relationship between rehearsal time and final 
recognition for both classes of items. The basic conclusion is again that there was 
no constructive rehearsal of the type required for recall. However, rote mainte- 
nance rehearsal apparently will strengthen an item’s representation sufficiently to 
improve long-term recognition. What is not known is how much of each of the 
delay periods was actually spent in rehearsal. If the range in the number of 
rehearsals was limited, there might have been sufficient variation to affect recog- 
nition but not recall. 

4. To round out the picture, it should be mentioned that a lack of relation 
between rehearsal frequency and end-of-session recall has also been found in the 
traditional Brown-Peterson situation (117). The retention intervals were either 
filled with the usual backward-counting task or left free for rehearsal. On the tests 
for individual trigrams the experimental manipulation produced the expected 
differences—there was heavy forgetting under the distractor condition and nearly 
perfect recall over all retention intervals when. rehearsal was allowed. However, 
the final recall scores were virtually identical. According to the authors, unre- 
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stricted rehearsal served to maintain the items in STS but not to encode them in 
LTS. Unfortunately, it is difficult to draw any conclusions from these results since 
the probability of final recall was only .10. A list of 25 CCCs of low association value 
is just extremely difficult to recall. The problem of low sensitivity produced by 
floor effects arises, to varying degrees, in the assessment of most of the final recall 
scores that have been considered in this section. 

We have discussed these studies in some detail not only because of their 
critical importance for dual-process theory but also because they call into serious 
question a widely held and rarely questioned assumption that there is a direct 
relation between retention and amount of rehearsal or study time. The proposition 
that rehearsal activities can vary in effectiveness should cause little surprise. What 
is rather startling is the flat assertion that sheer repetition or “maintenance re- 
hearsal” has only momentary effects and fails to influence recall even a few 
minutes later. Not long ago, Cooper. & Pantle (31) came to a very different 
conclusion on the basis of their review of the evidence concerning the total-time 
hypothesis: “When task requirements do not exceed simple rehearsal ... a fixed 
amount of time is necessary to learn a fixed amount of material” (p. 232). The 
apparent conflict with the total-time principle has been noted by others (37). That 
principle has, of course, limited validity, but there certainly have been no indica- 
tions of a zero correlation between study time and performance. One might try to 
reconcile the divergent conclusions by claiming that rehearsal was adequately 
constructive whenever a positive relation to study time was found. Possibly so, but 
there is also the finding that sheer “mindless” repetition—pronouncing items aloud 
over and over—has very substantial effects on the level of incidental learning (113, 
114). Performance of orienting tasks which are alleged to involve no “semantic 
processing” does result in respectable amounts of incidental learning, even though 
not as much as observed under conditions believed to induce such processing (77). 

For the time being, it will be prudent to view the proposed dichotomy between 
maintenance rehearsal and constructive rehearsal with great caution. Dichotomies 

‚have a way of: becoming points on a continuum. Null effects such as those 
observed in the last three studies described above do not provide a firm basis for 
broad generalizations, especially when there are reasonable questions about the 
degree of control over the variables and the sensitivity of the measures of retention. 
It may be that a classification of rehearsal activities with respect to degree of 
“constructiveness” will prove very useful, but if that is to happen, one has to be 
mindful of the danger of circular reasoning in the use of vague and unspecified 
qualitative distinctions. The experiment by Jacoby & Bartz (79) showing the 
beneficial effect of an interpolated distractor activity on subsequent recall is a 
case in point. The inference that subjects anticipating an interfering task can 
proceed at a moment’s notice to generate more effective retrieval cues (whatever 
these may be) is entirely ad hoc. There is nothing in the experimental manipula- 
tions related to retrieval cues; to argue that the quality or usefulness of the cues 
was improved does no more than to restate the apparent effect of the independent 
variable. We return below to the difficulties that arise in attempts to establish 
taxonomies of processing. 
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The Limited Capacity of STS and Short-Term Forgetting 


Short-term forgetting is closely linked to the limited capacity of STS in dual-pro- 
cess theories. As Atkinson & Shiffrin (4) see it, information decays from the store 
unless it is maintained by rehearsal. But the number of units that can be so 
maintained is strictly limited. Each new item entered into the rehearsal buffer 
displaces one already there. One can of course postulate displacement as the basic 
mechanism of loss and discount decay (200). Within this general framework two 
major uncertainties concern the capacity of the store and the relative importance 
of decay and displacement. 

As Craik & Lockhart (36) point out, the concept of limited capacity has ambig- 
uous connotations since it is often unclear whether the limitation is meant to apply 
to the number of units that can be stored or the rate at which successive units can 
be processed. They suggest that existing numerical estimates bear more on 
storage than processing limitations. In any event, the available estimates vary with 
the definition of functional units and the method of measurement (often deviating 
drastically from the magic number seven plus or minus two). The interpretative 
difficulties that beset the assessment of STS storage capacity may be illustrated 
with reference to the work of Glanzer and his associates (summarized in 54). The 
point of departure in this series of studies is the assumption that the serial position 
curve in free recall reflects the output from two different stores: for all positions 
except the terminal ones retrieval is primarily or entirely from LTS; terminal items 
may be retrieved from either LTS or STS but on an immediate test of recall may be 
expected-to come primarily from the latter. This basic assumption is taken to be 
validated by the interaction of task and presentation variables with the serial 
position of the items at input. Factors such as rate of presentation, list length, word 
frequency, and interitem associative relations have systematic effects on recall at 
all but the terminal serial positions. 

The raw interactions obviously do not imply anything about output from sep- 
arate stores. Given the purely descriptive phenomenon of a recency peak, noth- 
ing more may be involved than a ceiling effect. Some estimate of the amount of 
information retrieved from each of the stores is needed to interpret the interac- 
tions. To accomplish this objective, Glanzer, like numerous other investigators, 
has applied a formula devised by Waugh & Norman (200). The asymptote of the 
serial position curve or the level of recall after a delay furnishes a measure of LTS. 
The output of STS is then calculated by means of a subtraction procedure. The 
worrisome feature of this approach is that the logic of measurement comes close to 
being circular. The existence of two separate stores is assumed, not independently 
demonstrated, and the choice of measures of LTS essentially begs the question or 
at the very least makes an arbitrary assumption about the sources of output at 
different positions or after different retention intervals. Solving a single equation 
with two unknowns is necessarily a bootstrap operation, and measures flowing 
from such a solution have to be viewed with considerable reservations. 

These difficulties should be borne in mind when we consider Glanzer’s major 
conclusions regarding the capacity of STS, which hinge critically on the validity of 


VERBAL LEARNING AND MEMORY 305 


the subtraction measures. He found the capacity of the store limited to “about 
three items.” Such a low estimate was bound to strain credulity, if for no other 
reason than because “it is incompatible with what we know about understanding 
grammatical utterances” (121, p. 91). Recently Glanzer & Razel (56) attempted to 
meet this criticism by showing that the capacity limitations (which they revised 
downward from about three to two) apply to units that may vary in length rather 
than to fixed elements such as phonemes, morphemes, or words. They found that 
the estimated number of units in STS remained constant at about two, regardless 
of whether the units were monosyllabic words, disyllabic words, compound words, 
cliché sentences, or unfamiliar sentences. It should be understood that the near 
invariance in the number of units represents the derived estimates, specifically the 
sums of the estimates for the last six serial positions. The actual data often show 
substantial differences in the level of recall except for the ubiquitous recency peak. 
The authors’ conclusion is that the number of units held in STS is essentially fixed 
but that the size and complexity of the units can vary freely and may be as long as 
a complete sentence. Thus, the limited-capacity store can function effectively in 
linguistic processing. ` 

The argument is not convincing. The phenomenon of chunking is hardly new. 
The recency peak is also here to stay, but whether it largely reflects the output of a 
distinct and separate system is something to be proved rather than postulated. 
What is most troublesome is the reconstruction of the fine grain of the memory 
system by means of a formula of highly dubious validity. The Waugh-Norman 
equation does yield numbers that are systematically related to the changing 
probabilities of recall as a function of serial position. It seems overly sanguine to 
reify these numbers into “real” units in a hypothetical store. One might well be 
pulled up short in pursuing this tack when he finds that this formula yields zero, 
and even negative, estimates of STS output for individual subjects. However that 
may be, this exercise still leaves unaltered the astounding claim that during the 
acquisition of a list of unrelated words the STS can hold only two such units. 

Is the apparent evanescence of the contents of STS attributable to decay, 
displacement, or both? This once highly controversial question has not been 
debated much of late, undoubtedly because of the apparent difficulty of arriving at 
an experimental decision between the alternatives. Recently an ingenious new 
procedure developed by Reitman (145) has served to reopen the question. The 
innovation was to fill the interval between item presentation and test with an 
exacting tone detection task that would preclude verbal interference and at the 
same time minimize if not eliminate rehearsal. Thus one might come reasonably 
close to assessing the effects of the sheer passage of time on short-term retention. 
There was no forgetting over a 15-sec period under this arrangement. Similar re- 
sults were obtained by Shiffrin (165) in an extension of Reitman’s experiment. The 
implication that a strictly nonverbal distractor task does not result in the loss of 
verbal items from STS has not unexpectedly been called into question. The 
objection was raised that Reitman’s signal-detection task was not sufficiently 
demanding to prevent some recycling of the material held in memory. With a 
tone-shadowing task considered to be more demanding, progressive declines in 
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retention were observed; furthermore, interpolated pursuit-rotor activity led to a 
reduction of the terminal peak in free recall (199). The investigators concluded 
that two different factors may contribute to the loss of items from STS: (a) a 
diversion of attention from the to-be-remembered items which may be effected by 
nonverbal distractors, and (b) the similarity between the items in store and the 
interpolated materials which produces difficulty in retrieval. Thus the standoff 
between decay (either in its pristine sense or in the guise of diversion of attention) 
and interference will have to continue for a while longer. 

In general, there has been little systematic progress in the exploration of the 
mechanisms responsible for forgetting from STS (whatever its capacity). Glanzer 
has remained a staunch defender of an essentially pure displacement principle. In 
his free-recall experiments, the similarity between interpolated and list items 
appears to have no effect (55). To draw a strong conclusion that the similarity 
between the target items and the interpolated activity is irrelevant obviously flies 
in the face of a great deal of evidence (which need not be enumerated here) that 
both phonemic and semantic similarity significantly influence the amount of 
retroactive interference in short-term memory situations. Glanzer (54) suggests 
that such results may reflect effects on LTS rather than STS. The same argument 
has been made about proactive interference in short-term memory, with the aid of 
the Waugh-Norman subtraction formula (34). Whether one accepts this argument 
depends to a considerable extent on his faith in the precision of the measurement 
of STS by subtraction formulas. 

As in the work on long-term retention of lists, the understanding of the condi- 
tions and mechanisms of proactive inhibition (PI) in short-term memory appears 
to be decreasing rather than increasing. Earlier there was good reason to believe 
that such principles as unlearning and spontaneous recovery were directly appli- 
cable to the explanation of proactiye losses in short-term memory, and the 
Brown-Peterson situation in particular (86). It has now been shown that there may 
be little or no unlearning of successive items under the very conditions that lead to 
a progressive buildup of PI. The most direct demonstration of this fact comes from 
a study by Dillon (44), whose subjects were required to recall their previous 
responses at various stages of a critical trial (either before the presentation of the 
new item or at one of several points during the retention interval). There was no 
evidence for either unlearning or spontaneous recovery. Absence of unlearning 
can also be inferred from the substantial proactive facilitation that is found when 
items are repeated after a lag of several trials (26). Repetition enhances recall more 
than does release from PI effected by a shift in taxonomic category (63). 

The effectiveness of competition at recall as a determinant of short-term PI 
has also been questioned. One line of evidence is that recall does not increase 
when the subject has the preceding items before him at the time of test (25, 44). 
This fact does not, however, constitute as strong an argument against competition 
as might appear at first blush. The actual presence of the prior items may enhance 
their competitive potential. The situation is reminiscent of the negative effects of 
list cues in free recall (155). However, these results do raise questions about the 
importance of the role played by failures of item differentiation. If not unlearning 
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and competition, then what? There are several other possibilities. One recurrent 
suggestion is retrieval failure. For example, in experiments on release from PI, the 
effectiveness of item-specific retrieval cues may decline as members of the same 
class follow each other; when there is a shift in class, the power of the most recent 
retrieval cue is restored. Gardiner, Craik & Birtwistle (52) have presented evi- 
dence that appears to favor this interpretation. When the shift is from one subset to 
another within a broad category, release from PI occurs only when the name of the 
new subset is presented either at input or at the time of recall. Retrieval difficulties 
are implicated because of the effectiveness of a subset cue that is first introduced in 
the test phase. 

It is not clear to what extent such findings can be generalized to situations in 
which successive items do not belong to a circumscribed category. There is not 
enough evidence to attribute the buildup of PI for all types of materials to the 
progressive weakening of retrieval cues. The explanation would soon falter or 
become ad hoc. When successive items do not fall into normative groupings, we 
usually do not know what the retrieval cues might be, or whether it is even useful 
to think in terms of changes in the effectiveness of such cues. The possible role of 
retrieval cues in the buildup of, and release from, PI must also be examined in the 
light of several demonstrations that tests of recognition show changes in perfor- 
mance over trials quite similar to those observed with recall (25, 59, 60, 64, 139). If 
one assumes that retrieval does not play a critical part in recognition, then the 
implication is that PI reflects deficiencies of storage or perhaps forgetting during 
storage in the sense of Posner’s (141) “acid bath” hypothesis. If the proposition 
that recognition is relatively retrieval-free is rejected (181), the inference obviously 
does not follow. 

The available recognition data have failed to yield conclusive information 
about the role of item differentiation in PI. Previous items have been found to be 
more powerful distractors than new ones in some cases (60, 64) but not in others 
(59, 139). The total picture remains blurred. A search for a single locus or single 
mechanism of proaction is not likely to prove any more successful for short-term 
retention than it has in research on long-term retention of lists. Analyses in terms 
of conventional principles of interference have begun to yield diminishing returns. 
Among alternative hypotheses the possibility mentioned by Dillon (44) that PI 
may reflect the progressive degradation of the quality of learning deserves especial 
attention. The same possibility has been brought to the fore in studies of PI in the 
long-term retention of paired-associate lists. The principles of PI may in the end 
turn out to be the same in short-term and long-term retention but are likely to 
include mechanisms not now encompassed by theories of interference. 


The Moral of the Story of Dual-Process Theory 


The moral one may well draw from the history of dual-process theory is that there 
are important risks in the rush to modeling which has of late been so characteristic 
of the field of memory. What happened in retrospect is that certain observed phe- 
nomena of memory, e.g. the forgetting of a single trigram in a matter of seconds, 
looked strikingly different from those typically recorded under conventional 
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experimental arrangements. The difference in appearances was so great that there 
was a strong disposition to look for other evidence implying the operation of 
qualitatively different processes in spite of the obvious objections to postulating 
discontinuities of function.” The computer-oriented nomenclature of information 
processing was there, waiting to be used, and the processes were soon translated 
into structures. Protestations to the contrary notwithstanding, the structures be- 
came reified. Counting the exact number of units or slots in STS signifies nothing 
less. 

Now a structural model with bins connected by arrows is potentially as useful as 
any other theoretical framework, i.e. it should be judged by its heuristic value. Fair 
enough. In practice, however, structural models appear to entail risks that are 
endemic to them: premature precision, internal fission, and proliferation. Once an 
STS was proposed, its capacity, the characteristics of the units held in it and 
displaced from it, and its temporal parameters had to be specified. As we have 
suggested, the measurement procedures have tended to be circular and to involve 
arbitrary assumptions. Once the estimates are made, however, the validity of the 
formulas is rarely questioned, and the numbers are taken literally; that is prema- 
ture precision. As for internal fission, it occurs predictably when new or unex- 
pected findings have to be accommodated. It is easy enough to add another box 
and pair of arrows, for example an intermediate-term as well as short-term and 
long-term memory, or to subdivide the store into compartments for different 
modalities. The data promoting internal fission are often important, but patching 
up a flow chart does not really aid in understanding and explaining them. Finally, , 
there is proliferation. Diagrams representing the successive stages of information . 
processing are easy to construct, and there are no apparent rules delimiting the 
` kinds of structures that one can incorporate into them. So the number of general 
models inevitably grows apace, to say nothing of specialized models for such 
processes as recall and recognition. For a sampling see pages 21, 118, 205, 241, 347, 
377, and 382 in Models of Human Memory (130). There would be more if some of 
the models had not been described verbally. Variety may be the spice of life and 
heterodoxy an essential condition of scientific progress, but modeling license does 
contribute to theoretical chaos. The time may have come to unsheath Occam’s 
razor once more: Models should not be multiplied beyond necessity. 

If this picture is somewhat overdrawn, it was done deliberately to call 
attention to the dangers of the theoretical particularism which has been rife in the 
study of learning and memory. We have traced the source of this trend to the 
formulation of two-store models. The retreat from such models began as their 
assumptions were subjected to increasingly searching scrutiny. The implications of 
these developments are well worth pondering. 


* We have not considered the results obtained with brain-damaged patients which con- 
tinue to be cited as evidence for dual-process theory (201). The existing data do not impress 
us as unequivocal; more important, extrapolations from pathological deficits to the structure 
of normal memory are of uncertain validity. We must also note a temptation to reverse the 
argument and to identify STS and LTS on the basis of comparisons between normal subjects 
and patients (8, p. 181). 
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LEVELS OF PROCESSING AND DIMENSIONS OF ENCODING 


Levels of Processing 


In what promises to be a very influential paper, Craik & Lockhart (36) suggested 
that the broad range of phenomena which had motivated the postulation of mul- 
tiple stores can be conceptualized more adequately in terms of levels of processing. 
The guiding assumption is that the presentation of any stimulus initiates a series or 
hierarchy of processing stages which can be graded along a continuum of depth. 
Sensory analysis and semantic-cognitive elaboration define the end points of this 
continuum. Subsequent retention is a function of depth. “Thus, memory, too, is 
viewed as a continuum from the transient products of sensory analyses to the 
highly durable products of semantic-associative operations” (p. 676). Regardless 
of depth, information can be maintained in memory by being recirculated at a 
currently activated level of processing. Recirculating means keeping the items, as 
analyzed up to a given point, in consciousness and is essentially synonymous with 
the maintenance rehearsal of dual-process theory. The recycling operation is 
designated as Type I processing and defines primary memory. By contrast, Type H 
processing occurs when the analysis proceeds to a new and deeper level. At any 
one level, Type I processing will not improve retention beyond that characteristic 
of that level, only Type II processing will. The depth to which analysis proceeds 
and the amount of material that can be encompassed are constrained by the 
limited capacity of a central processor which can be deployed flexibly “to one of 
several levels in one of several encoding dimensions” (p. 679). The limited-capa- 
city processor expends its energy economically and initiates maintenance rehear- 
sal at a relatively shallow level, e.g. the acoustic-articulatory one, when that is 
sufficient to satisfy the task demands. The length of the sequences that can be 
maintained depends on their compatibility with existing cognitive structures: the 
greater the consistency, the more readily the components can be grouped or 
chunked for purposes of processing. 

One apparent advantage of this approach is that observed differences between 
short-term and long-term memory can be referred to the flexible mode of opera- 
tion of the information-processing system. There are no longer any fixed expec- 
tations about either the number or the characteristics of units that should be 
retained under the experimental arrangements used to define short-term and 
long-term memory. However, the way in which the flexible central processor is 
believed to advance to successively deeper levels of analysis, and the conditions 
which are assumed to be conducive to maintenance rehearsal, are fully consistent 
with the functional differences previously attributed to STS and LTS. Findings 
which have created fundamental difficulties for dual-process interpretations, such 
as the lack of correlation between the sheer amount of rehearsal and subsequent 
retention, can be readily referred to the failure of stimulus analysis to have 
reached a sufficient level of depth. And the critical serial position effects in 
immediate and delayed recall can be handled easily by assuming that depth of 
encoding varies with the position of an item in the list. 

Craik and Lockhart view their approach not as a full-fledged theory (for ex- 
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ample, problems of retrieval are omitted from consideration at this point) but 
rather as a conceptual framework for research on memory. Itshould be considered 
in this spirit. The analysis is timely and significant to the extent that it makes a 
convincing case for an interpretation of the apparent differences between short- 
term and long-term memory without recourse to a structural two-store model. The 
_ argument makes it clear that the existing facts do not compel us to accept as 
axiomatic a dichotomy of nonsensory memories with structural properties. It is 
less clear, however, that the fundamental issue of continuity versus discontinuity 
of memory functions has been resolved satisfactorily. Does the discontinuity not 
surface once more in the guise of stages of processing? The levels of processing 
evolve in stages, but the authors assure us that they envisage these levels as a 
continuum of analysis. But if there is a true continuum, then the identification of 
separate stages is either impossible or arbitrary. When we see the scheme applied 
to specific phenomena, we come away with the impression that the successive 
levels follow each other in a stop-and-go fashion. Recycling can be initiated at any 
of the postsensory stages. Thus there are in effect qualitative discontinuities in 
memory; for example what is retained may be primarily acoustic rather than 
semantic information. The final level that will be reflected in retention is not, 
however, correlated in a one-to-one fashion with processing time; the authors 
recognize that analysis does not necessarily proceed in an orderly sequence from 
superficial to deep. Why then insist on time-dependent stages? The basic conclu- 
sion that emerges from the analysis is that information can be processed in many 
different ways, and what is remembered depends on the kind of processing that 
has occurred. That sounds like a truism, but it was probably an important truism to 
restate at this point. The arrangement of levels of encoding in temporal lockstep 
appears to be a continuing concession to the claim that there are systematic 
changes in mode of encoding as one moves from short-term to long-term memory. 
Yet, given the ambiguous implications of confusion errors, there is no hard 
evidence for this claim. It has yet to be shown that under normal circumstances an 
individual hearing a word will process and recycle only the sound without know- 
ing what the word means. What we are questioning here is not the assumption that 
processing takes time, and that the more diversified the processing the more time is 
required; it is rather the more or less orderly progression from surface to depth 
that is at issue. 

The normative concept of depth is likely to prove troublesome. It is taken for 
granted that the semantic-associative elaborations represent the deeper levels of 
analysis. That assumption has face validity when one thinks of associative or free 
recall. But there are situations in which the processing of other features, particu- 
larly of frequency and contextual information, becomes critical. Much of the 
current work on recognition points to this conclusion. Must the definition of depth 
then change with task requirements? Is “optimal processing” different from pro- 
cessing to maximal depth? It should also not be taken as self-evident that trace 
strength and durability increase directly with depth of analysis. It is true that there 
is some evidence that acoustic features are forgotten more rapidly than semantic 
ones (e.g. 5, 24), but the differences are not very large and it is not clear whether 
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they are a function of trace strength or variations in interpolated interference, It 
had once seemed intuitively obvious that there is an inverse relation between 
meaningfulness and rate of forgetting, but this expectation has not been borne out 
empirically. 

If a conceptualization in terms of levels of processing is to provide a productive 
framework for research on memory, we must ask how likely it is to generate 
experimentally testable hypotheses. The very flexibility which is attributed to the 
processing system carries with it the danger of circularity. The specific danger is 
that one would be tempted to infer what the level of processing was from the 
results of retention tests. But such evidence is obviously post hoc. What is needed is 
to bring the level of encoding under experimental control and to observe the 
effects on retention. Craik and Lockhart point out that the incidental learning 
paradigm permits one to do just that. The experimenter can determine how the 
subject processes the material by his choice of orienting task. They cite numerous 
_ studies which show that amount of incidental memory is directly related to the 
depth of processing required by the orienting task. This line of evidence bears 
closer scrutiny. 


TYPE OF PROCESSING AND INCIDENTAL MEMORY In an extensive series of experi- 
ments, Jenkins and his associates (75-77, 82, 178, 196) have marshalled evidence 
for the proposition that, regardless of instructions to learn, the amount of retention 
is determined by whether or not the subject engages in semantic processing. Other 
investigators have likewise concluded that incidental memory is a function of the 
degree of semantic processing involved in the orienting task (45, 80, 118). For 
purposes of examining some of the interpretative problems that remain to be re- 
solved, we confine ourselves to a consideration of the studies of Jenkins’ group. In 
these experiments, the typical pattern of results is that incidental subjects per- 
forming a semantic task recall as much as those who perform the same task and 
also expect to be tested for recall; furthermore, there is no difference between 
these two groups and a control group that is simply instructed to learn the words 
and does not perform an orienting task. When the orienting task is nonsemantic, 
retention is considerably lower, and equally so for subjects who do, and subjects 
who do not, expect to be tested for recall. The conclusion is that the only critical 
determinant of recall is the type of processing, and this holds true regardless of the 
difficulty of the orienting task. 

This conclusion is very strong, but there are some rather troublesome features of 
both the experimental manipulations and the results which call for caution. The 
first concerns the a priori differentiation between semantic and nonsemantic tasks. 
The semantic task used most often was rating the stimulus words as to their 
pleasantness or unpleasantness; in other cases the frequency of usage of the words 
was rated, or a word that could be appropriately paired with the target item was 
generated. A variety of nonsemantic orienting tasks was devised to induce the 
subject to “treat the word as an object without considering its semantic charac- 
teristics (or meaning)” (82, p. 92). The activities falling under this heading in- 
cluded checking for the presence of certain letters in the words, estimating the 
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number of letters, and producing rhymes. Also included in this category were 
syntactic tasks, e.g. determining the part of speech and deciding whether a given 
word did or did not fit into a particular sentence frame. How valid is the classifi- 
cation of the tasks? Few would be disposed to deny that checking for the presence 
of certain letters need not involve processing the meaning of a word, but credulity 
is strained when the same assertion is made about identification of parts of speech 
and judgments of the appropriateness of a word for a sentence frame. On the other 
hand, why is judging the frequency of a word a deeply semantic activity? One does 
not have to think very much about the meaning of a word to decide whether or not 
it is a familiar “object.” The point of these carping comments is that “semantic 
processing” is a concept lacking in precision and invoked on the strength of the 
investigator’s intuition which may not be shared by others. The same applies to the 
delimitation of situations in which such processing presumably does not occur. 

In any event, there is every reason to worry about the implications of the results 
because the experimental manipulations work, as it were, too well. Specifically, 
there are two warning signs. First, subjects forewarned about the test of reten- 
tion do no better than purely incidental learners, even when the orienting tasks are 
nonsemantic. That is an unusual result which is at variance with most of the 
available evidence (112). Surely the instructed subjects should have engaged in at 
least a modicum of semantic processing. Second, the subject’s ability to grasp and 
store the semantic characteristics of a word is apparently so great that it is not 
affected by normal limitations on processing capacity. That is implied by the 
finding that retention is not significantly lowered when the subject must perform a 
nonsemantic as well as a semantic task. (Actually, the difference is always in the 
expected direction and in some cases sizeable.) 

The results may not be nearly as decisive as the investigators supposed. A 
question that invites careful examination is whether there are systematic differ- 
ences between the supposedly semantic and nonsemantic tasks. One potentially 
important difference is that the semantic tasks call for subjective ratings which 
cannot be either right or wrong. Most of the nonsemantic tasks, on the other hand, 
require the subject to make a response the correctness of which can be evaluated 
objectively. The subjects can monitor their own performance and are likely to do 
so. To put it somewhat differently, the nonsemantic tasks involve a determinate 
amount of information reduction whereas the semantic ones do not. Now it is 
known that in short-term memory situations retention declines in an orderly 
fashion as the amount of information to be reduced by performance of the 
interpolated task increases (141). The assertion that it is the degree of semantic 
processing which constitutes the only, or the critical, difference between the two 
classes of orienting tasks remains, therefore, to be validated. If the processing 
demands of the orienting task are sufficiently great, the absence of differences 
between instructed and uninstructed subjects would become understandable. As 
for the failure of concurrent tasks to reduce the effectiveness of semantic process- 
ing, reconciliation with other evidence that leads to the opposite conclusion is 
clearly needed. Quite recently, Mandler & Worden (104) presented evidence 
which they interpreted as demonstrating that the performance of a subsidiary task 
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does significantly impair subsequent recall and recognition. (It is interesting to 
note that these authors classified the identification of words as nouns or verbs as a 
semantic processing task!) The actual data obtained by Jenkins and his associates 
also bear careful reexamination. In one relevant study, six out of six compar- 
isons showed higher recall when a semantic task was performed by itself rather 
than in conjunction with a nonsemantic task (196, experiments I and III). More 
evidence will be needed before it becomes reasonable to conclude that semantic 
activity is immune to the constraints of limited processing capacity. 


The Multiple Dimensions of Word Encoding 


Few students of memory today would be willing to assert in the manner of 
Gertrude Stein that a word is a word is a word. On the contrary, they are becoming 
increasingly impressed with the multitude of features and attributes which the 
learner can perceive and encode when a word is presented to him. What remains 
uncertain is which and how many of such characteristics are processed and en- 
coded on the occasion of a brief presentation during a learning experiment. As we 
have seen, Craik and Lockhart would have us think in terms of a sequence of suc- 
cessive encoding steps, with the sequence subject to termination at any convenient 
stage. A very different picture is painted by Wickens (202) on the basis of his 
programmatic series of experiments in which release from PI was effected by shifts 
in item characteristics along many diverse dimensions. He concludes that in the 
“split second” while a word is processed by an individual “it is granted a locus on 
many . . . dimensions or aspects—encoded, in short, in a multiplicity of ways” (202, 
p- 12). These dimensions are both semantic and nonsemantic, and Wickens views 
their activation as essentially simultaneous and automatic. As the test of release 
from PI continues to be applied, the number of encoding dimensions that appear 
to be activated during the presentation of a word keeps growing. Some recent, 
additions include sense impressions (205), syntactic features (65), language of 
presentation to bilingual subjects (58), and phonological features (29). There are 
also some puzzles such as the asymmetry of modality shifts which is observed 
under some conditions but not others (71-73, 92). 

The conclusion that all the dimensions for which release from PI has been 
observed are simultaneously encoded during the presentation of a word can be 
questioned. As Underwood (187) has pointed out, there are potentially serious 
objections to such an inference. First, it is doubtful that all the attributes that have 
been investigated are independent. However, even if some of them are tied 
together, an impressively large number of independently encoded characteristics 
would be likely to remain. A more basic question is raised by Underwood’s 
suggestion that the experimental arrangements of the release experiment carry an 
instruction to the subject to encode successive items along a particular dimension. 
That is, the critical dimension is primed; for example, treating the items as 
members of a specific taxonomic class becomes more and more likely as one 
instance after another is presented. To the extent that priming occurs, the number 
of attributes that are encoded automatically and simultaneously may be greatly 
overestimated. One cannot, therefore, add together all the features that have been 


314 POSTMAN 


shown to be effective in separate release experiments and thus reconstruct the total 
ensemble of encodings that are activated simultaneously. It is altogether reason- 
able to suppose that more than one feature of a word is registered, and there is 
some evidence that the release effect is greater when there is a shift with respect to 
two characteristics rather than one (203). What the limits of multiple encoding 
might be under normal circumstances remains uncertain. 


Encoding of Phonological Attributes 


The role of phonological attributes in the acquisition of lists of words has begun to 
receive systematic attention only fairly recently. There may have been a disposi- 
tion to take it for granted that such attributes are of little significance in long-term 
memory, in accordance with the assumption of two-store theories. There is now 
abundant evidence that such an assumption would have been incorrect. Perhaps 
the best documentation is contained in a programmatic series of studies by Nelson 
and his associates (summarized in 126) in which the ordinal position effects of 
letter duplication were carefully examined. The essential finding is that the mag- 
nitude of such effects is greatest in the initial, second greatest in the terminal, and 
least in the medial, positions of stimulus words. The direction of the effects 
depends on task requirements. In paired-associate learning, where stimulus dis- 
crimination is essential, difficulty of acquisition conforms to this pattern. On the 
other hand, when recall is free, the ranking of conditions is reversed; the same is 
true when the position of elements shared by stimulus and response terms is 
manipulated in paired-associate learning. Nelson suggests that words are pro- 
cessed as phonological sequences, essentially like miniature serial lists. The ease 
with which access is gained to this code varies with the serial position of the 
elements, with the initial position dominant. Response recall is facilitated by the 
‚presence of overlapping features, whereas the associative stage of learning is 
delayed. An interesting recent finding is that the typical ordinal position effect is 
present during the acquisition of pairs of concrete as well as abstract words, even 
when the former are learned under instruction to generate interacting images 
(127). The implication is that the imaginal attributes and phonological features are 
processed independently. There is little doubt that phonological similarity can 
play an important role even when familiar words are used as the learning mater- 
ials. Evidence obtained in studies of transfer (97, 204) and false recognition (e.g. 
38, 128, 158) fully supports this conclusion. 


Are Words Stored When Sentences Are Encoded? 


We have been discussing the encoding of individual words in the context of list 
learning. This may be the appropriate place to mention the hypothesis, which has 
gained some popularity, that when individuals are exposed to interrelated sen- 
tences, they may not encode and store individual strings of words at all but rather 
“wholistic semantic ideas” that represent the total propositional content of the 
sentences. This hypothesis was advanced by Bransford & Franks (23, 49). The 
individual sentences presented to their subjects contained from one to three 
atomic propositions which were drawn from a complex integrated statement that 
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comprised four such components. On a subsequent test of recognition the confi- 
dence with which the subjects recognized a sentence was a direct function of the 
number of atomic propositions contained in it. As long as a test sentence was 
consistent with the total integrated idea, it did not make any difference whether 
that particular sentence had actually been presented or not. Thus the full four- 
proposition sentence received the highest confidence ratings even when it had not 
been presented. Test sentences which were inconsistent with the total idea were 
readily rejected. This pattern of results was taken to imply that unified semantic 
representations reflecting the total complex ideas were stored in memory; recog- 
nition then depends on the propositional overlap between the stored representa- 
tion and the test sentence. Subsequent work has served to cast doubt on the 
conclusion, at least in its strong form. The strength of the relation between 
recognition and the number of component propositions in a test sentence was 
shown to depend on instructions. This relation is attenuated when the recognition 
decision is based on identity of meaning rather than on the actual prior occurrence 
of a test sentence (84). Subjects are able to recognize component propositions to 
which they have been exposed, even though they may be uncertain whether they 
heard them alone or in conjunction with other components. Furthermore, as 
Anderson & Bower (3, pp. 347 ff.) have demonstrated, the difficulty of discrimi- 
nating between old and new sentences is predictable on the basis of the overlap in 
amount of contextual information attached to target and distractor sentences. The 
discrimination between old and new sentences is improved when a forced-choice 
recognition procedure is used. Finally, when simpler elements (letters and 
numbers) are used in a formal arrangement corresponding to that used in the 
semantic studies, it is found that subjects can abstract the rules but also retain 
specific items (146). It is clearly premature to conclude that specific verbal infor- 
mation is not stored when general linguistic ideas are abstracted from the input. 


THE VARIABILITY AND SPECIFICITY OF ENCODING 


We have used the concept of encoding freely without stopping to define it. That is 
characteristic of much contemporary writing on learning and memory. References 
to encoding are ubiquitous because it is taken as axiomatic that verbal inputs are 
somehow selectively processed or transformed before they are stored in memory. 
For purposes of the discussion that follows it is useful to examine the various 
connotations of encoding explicitly. A valuable classification of the types of 
encoding that are assumed to occur has been provided by Bower (21): (a) selec- 
tion—a component is selected out of a complex pattern; (b) rewriting—an input is 
translated into another format, e.g. a sequence of visual symbols is transformed 
into a verbal description; (c) componential description—a nominal item such as a 
word is registered as a list or complex of attributes or features; (d) elab- 
oration—additions are made to the nominal input in order to transform it and ren- 
der it more memorable, e.g. natural language mediators (142). 

The basic question of interest to the investigator of memory is how these 
different kinds of encodings influence acquisition and retention. Demonstrations 
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that some type of encoding, such as the use of natural language mediators, is useful 
under some circumstances (119) does not carry us very far theoretically. (The 
evidence in support of this widely made claim is not very impressive.) The theo- 
retical implications assume more fundamental importance when relationships 
between systematic characteristics of encoding and subsequent retrieval and rec- 
ognition are considered. Two such characteristics are the variability and the 
specificity of encoding. 


Encoding Variability 


The principle of encoding variability was developed by Martin (106, 108) in the 
context of associative learning and applied by him principally to the analysis of 
problems of transfer and interference. The essential argument is that the encoding 
of a nominal stimulus (A) has some probability of varying from one occasion to 
the next. Since associations (A-B) are formed to stimuli-as-encoded, variability 
retards acquisition. As the encodings stabilize, so does acquisition performance. 
Encoding variability becomes a positive factor when a new response must be 
attached to an old sttmulus (A-D). The guiding assumption here is that two 
different responses cannot be attached to the same stimulus-as-encoded. A failure 
to change encodings results in negative transfer and unlearning. When both B and 
D must be recalled, success in retrieval will depend on the probabilities of sam- 
pling the encodings specific to each response. 

The idea of encoding variability has also been applied in a quite different way, 
namely in contrasting the effects on the probability of recall of multiple as com- 
pared with single encodings of an item (115). Stated in its simplest terms, the 
proposition is that multiple encodings entail multiple retrieval cues which can" 
provide access to the to-be-remembered item at the time of recall. Hence there 
should be a direct relation between the number of different encodings and the 
probability of recall of an item. One way in which encoding variability may be 
induced is by presenting an item at different times or in varied contexts. Thus this 
argument has become of major interest in the many current investigations of the 
differences between massed and spaced repetitions. particularly in free recall. 


THE SPACING EFFECT IN FREE RECALL It has been shown many times, and no 
longer needs to be referenced, that recall is higher when repetitions are spaced 
rather than massed. Furthermore, it has been found in some, but not all, studies 
that the beneficial effects of spacing increase as a function of the lag between 
successive repetitions. (Some of the possible reasons for the discrepant findings 
regarding the effects of lag are discussed in 115.) Several explanations of the 
spacing effect have been proposed (cf 17), but, as measured by the sheer number of 
studies, the encoding variability hypothesis appears to have engaged the interest of 
the majority of investigators. 

The explanation of the superiority of distributed practice (DP) over massed 
practice (MP) in free recall took its point of departure from the assumption that 
the formation of subjective units facilitates retrieval. Given this assumption, the 
hypothesis was formulated by Melton (115, p. 605) as follows: 
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The prediction of the beneficial effects of DP, and the gradation of this beneficial effect 
as a function of lag, is obtained -by assuming, first, that the coding of a word in two 
different subjective units (or in a larger subjective unit) increases the cues or the access 
routes to its retrieval, and second, that as the lag between two or more occurrences of a 
word increases, the word contexts in which it occurs become less and less correlated 
(more independent) and the total number of different cues to its retrieval increases. 


Note that the basic assumption that is made here questions the generality of 
another principle that underlies much of the analysis of organizational processes 
in free recall, namely, that successful retrieval depends on the development of 
consistent, rather than variable, higher-order units. What little evidence there is 
indicates that the incorporation of an item into more than one grouping during 
successive presentations does not in fact aid free recall and indeed may impair it 
(22, 211). However, the situation may be different under conditions of spaced 
presentation during a single study trial. 

The encoding-variability interpretation of the MP-DP effect has two related 
testable implications: (a) the lag effect should be eliminated if variability can be 
induced in the encoding of all items regardless of spacing of repetitions; (b) the 
effect should likewise be eliminated if variable encoding of repeated items is 
minimized. An early experiment by Madigan (100) was based on this logic. To 
control encoding, each item was accompanied by a cue during presentation. When 
an item was repeated, the cue either remained the same or was changed. In the 
former case, the encodings of all repeated items should have remained constant, in 
the latter case they should have varied. Unfortunately, a lag effect was observed in 
free recall under both treatments. It should be noted, however, that the conditions 
of encoding were manipulated by means of item-specific cues, whereas Melton’s 
original formulation had emphasized the influence of the verbal context created 
by preceding items. The majority of subsequent studies have been based on the 
same rationale as Madigan’s investigation. Cued as well as uncued recall has been 
used, but for present purposes we need be concerned only with the latter since 
results obtained with cued recall do not bear directly on the proposed explanation 
of the MP-DP differences in free recall. 

Numerous investigators have taken advantage of the existence of homographs 
to control encoding, manipulating the contexts in which the homographs were 
repeated (sentences or preceding items). so as to induce either the same or a 
different interpretation of these ambiguous words. The results are so inconsistent 
as to be utterly perplexing, and there would be little point in a detailed review. 
Suffice it to say that there is support for a variety of incompatible conclusions: (a) 
controlled encoding neither influences the level of recall nor does it attenuate the 
lag effect (83); (b) controlled encoding does produce substantial differences in 
recall and does attenuate the lag effect (53); (c) there is a lag effect when encoding 
is forced to remain the same, but not when it is caused to vary (111). This pattern 
has also been observed on a test of recognition (208). So much for homographs. 

In other studies, nonhomographs were used and the sentence context varied, 
bringing to light additional problems; for example, differential encoding is said to 
eliminate the lag effect only when the subject’s rehearsal pattern is strictly con- 
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trolled (40). With other materials, the effectiveness of a second encoding has been 
described as being contingent on the forgetting of the first (183), but so far we have 
not been able to figure out how the subject ends up having the benefit of two 
retrieval routes under such circumstances. 

Another approach to the explanation of the MP-DP effect promises to yield 
more palpable results. The effect might not reflect an advantage of DP but rather a 
deficit under MP (185). Perhaps the subject spends less time processing a repeated 
item under massed than under distributed conditions. Direct confirmation of this 
hypothesis was obtained when subjects were allowed to pace themselves during 
the presentation of a long list containing both massed and distributed repetitions 
(164). However, internal analyses showed that the differences in recall could not 
be attributed entirely to the variations in exposure time. Some of the difference 
that remains unaccounted for may reflect differential rehearsal after the second 
repetition. The opportunity for rehearsals between presentations is directly 
related to lag; furthermore, the number of rehearsals after the second presentation 
has been shown to increase as a function of lag (154). Such bread-and-butter types 
of explanations may yet carry us well forward in understanding the MP-DP effect. 

Before we conclude this section on encoding variability, we return briefly to the 
evidence obtained in studies of transfer. The results and conclusions are not 
appreciably more clear-cut than with reference to the MP-DP effect. The idea that 
transfer should be profoundly influenced by the degree to which there are changes 
in the encodings of nominal stimuli is eminently reasonable. In particular, nega- 
tive transfer under paradigms such as A-B, A-D and A-B, A-Br should be capable 
of being reduced or eliminated if the common A term is appropriately recoded 
during the acquisition of the second task. Yet substantial interference under these 
conditions is the rule. One reason for this state of affairs seems to be that subjects 
are not disposed to recode stimuli but rather prefer to retain their old encodings. 
When compound stimuli with distinctive components are used, a shift in func- 
tional cue sometimes occurs under conditions of negative transfer (116) but often 
it does not (57, 161). The evidence is mixed as to whether a shift is necessarily 
beneficial (140, 206). Subjects may be conservative because in the long run they do 
better, or at least as well, working through associative interference with a familiar 
cue than trying to develop an entirely new encoding. The discouraging results 
obtained with compound stimuli, where the possibilities of recoding are obvious, 
do not tell us much about what happens when the nominal stimuli are integrated 
units such as words. Here there is little or no hard evidence for spontaneous 
recoding in the transfer phase. If recoding occurs but is apt to be so subtle as to. 
escape detection by available methods (107), the matter has to be held in abeyance 
for the time being. 

Is encoding variability a basic phenomenon of learning and memory? No one 
knows for certain. The alleged phenomenon is a highly subjective one, and the 
evidence has, to say the least, been elusive. Yet we expect no diminution in the 
number of ingenious and complex experiments designed to capture the elusive 
variability on the wing. A future reviewer might well say, “They sought it with 
thimbles, they sought it with care; they pursued it with forks and hope...” 
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Encoding Specificity 


We introduce this section with a well-known quotation from an article by Thom- 
son & Tulving: “No cue, however strongly associated with the TBR [to- 
be-remembered] item or otherwise related to it, can be effective unless the TBR 
item is specifically encoded with respect to that cue at the time of storage” (177, p. 
255). Before we consider the evidence bearing on this hypothesis, it is important to 
make clear under what conditions an item would be considered to have been 
specifically encoded with respect to a particular cue. This is not to be taken to 
mean that the learner habitually stores a potential retrieval cue with each item, 
internally generating paired associates, as it were. The assertion is rather that a 
retrieval cue will be effective only when the stored information includes the 
relation between the target word and the cue. Thus chair will be an effective 
retrieval cue for table only if the information about the table-chair relationship has 
been stored at the time of input (182, p. 359). This information will not necessarily 
be part of the trace of table because retrieval is from the episodic rather than the. 
semantic store (179). The table-chair relationship is a permanent part of semantic 
memory but may not be part of the trace in the episodic system laid down by a 
particular presentation of table. 

The principle of encoding specificity, formulated within the framework of the 
distinction between semantic and episodic memory, is put squarely in opposition 
to the “generation-recognition models.” According to the latter, a retrieval cue 
initiates the search for, or generation of, potential responses. A response that is 
recognized, on the basis of contextual or occurrence information, is then given 
overtly. This position has been espoused by Bahrick (9-11), who found that the 
effectiveness of a retrieval cue first introduced at the time of recall could be 
expressed as the joint probability of successful generation and correct recognition . 
of the target item. Thus recall is conceptualized as a two-stage process, involving 
both retrieval and recognition (2, 88). Tulving & Thomson (182) question two 
basic assumptions of the two-stage position: (a) that words have “transsituational 
identity,” ie. unique representations in memory; and (b) that the presentation of a 
word results in immediate access to its location in memory. Neither of these 
assumptions is acceptable if the encoding of words is context-specific and storage 
is in an episodic system. 

There are three important experiments by Tulving and Thomson which, singly 
and in combination, have been interpreted as supporting the principle of encoding 
specificity. These studies are well known and widely cited, and a brief recapitula- 
tion of the major findings should be sufficient here. 1. When TBR items are 
presented in the context of weakly associated cues, recall cued by these weak cues 
is considerably higher than uncued recall. By contrast, recall cued by strong 
normative associates is no better than uncued recall (177). 2. On tests of recogni- 
tion, either additions to or subtractions from the context present at the time of 
recognition impair performance. Context here refers to the presence of associates 
of the target item (181). 3. Items recalled in a context different from that at input 
frequently fail to be recognized. Words are again presented in the presence of 
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weak cues. Many of the target items are reproduced when subjects subsequently 
generate free associates to strong normative cues, but in a high proportion of the 
cases the target items fail to be recognized as members of the input list. In this 
situation cued recall is higher than recognition (182). 

The theoretical issues raised by these results are fundamental indeed. It should 
not be surprising that they can be seen as open to different interpretations. In 
considering the ineffectiveness of strong cues after input in the-presence of weak 
ones, we have to ask whether learning a pairwise contingency—in the presence of 
A respond with B—ipso facto entails the encoding of B with respect to A, in the 
sense that the semantic features of the target item (B) become assimilated to those 
of the cue (A). Suppose that does not happen to any significant extent. A strong 
normative associate (C) presented at the time of test may not facilitate, or may 
indeed hinder, recall simply by virtue of the fact that it is a new cue. It is known 
that failure to recognize a stimulus may lead to failure of recall. Furthermore, to 
the extent that the subject makes use of backward associations (B-A) to check on the 
correctness of potential responses (cf 209), the presence of the new cue may lead 
him to the erroneous rejection of target items. There is an obvious parallel here to 
the A-B, C-B paradigm of transfer where associative interference is observed with 
responses of high meaningfulness. 

Similar questions arise with respect to the finding that many target items are not 
recognized when they are generated by the subjects under free-association in- 
structions. Again there is a new cue, namely, the stimulus used to elicit the free 
associates. Thus erroneous rejection of target items becomes likely, for the reasons 
already mentioned. A complicating factor is that each target item had to be picked 
out from among four or six free associates given to the same test stimulus. The 
semantic similarity between generated targets and generated distractors was, 
therefore, probably quite high and the recognition task correspondingly difficult. 
Under these circumstances, one would expect subjects to adopt a stringent 
criterion in making their recognition decisions. There is no conclusive evidence on 
this point, but it is to be noted that in the study of Tulving and Thomson the raw 
hit rate was more than doubled when a forced-choice test of recognition was used. 
The implications of this study are sufficiently important to warrant a detailed 
consideration of all the factors that might be responsible for the reversal of the 
usual relation between recognition and recall. Whatever the outcome of further 
analyses may be, the fact that such a reversal can occur under some circumstances 
is of considerable theoretical interest. 

The major interpretative problem stems from the operations used to effect 
changes in encoding between presentation and test. The thrust of the theoretical 
argument is that the.encoding of the target item per se is profoundly influenced 
by the cognitive environment in which it appears. In the studies under discussion, 
the weak cues accompanying the target items at input constitute the cognitive 
environment. However, the experimental operations are essentially those of 
paired-associate learning and, especially after a series of set-inducing lists, em- 
phasize pairwise contingencies. To the extent that encoding specificity lies in the 
information about these contingencies, the broad theoretical implications of the 
results would be changed. 
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We turn now to the effects of contextual changes on recognition. The original 
findings have been confirmed and extended by Thomson (176). Although there are 
exceptions, either the addition or the deletion of context reduces recognition 
performance. The presence or absence of a second item is viewed as constituting a 
cognitive environment which has a critical influence on the encoding of the target 
item. Would it be equally reasonable to say that the mode of presentation of an 
item—whether it appeared alone or in juxtaposition with another word—is one of 
the attributes of the input event that is stored? It is known that subjects remember 
other “extraneous” features of input events reasonably well, such as serial position 
and modality of presentation. When a recognition decision is difficult, the subject 
may rely on whatever information remains available about the physical conditions 
of the original presentation. There is some apparent support for this interpretation 
in Thomson’s data: he found that the inclusion of both single items and pairs in the 
input list was a necessary condition of the impairment of recognition by addition 
of context. The same was not true for the deletion of context, but the interpretation 
of the latter results is complicated by the use of strongly associated pairs. 

The demonstration of context effects is taken as evidence for the operation of 
retrieval processes in recognition. Recognition depends on access to the stored 
representation of the item as originally encoded. If at the time of test the encoding 
of the target differs substantially from that during input, the probability of access 
is correspondingly reduced. Thus retrieval would be involved since the “location” 
of the target item in memory must be found. The thrust of this argument is that 
words, qua strings of letters or sound sequences, do not occupy a fixed “location” 
in memory. (It is not clear whether this position has actually been advocated in an 
unqualified form by any two-stage theorist.) The effectiveness of contextual 
manipulations does not, however, seem to imply anything very definite about the 
immediacy or automaticity of access to stored representations of target items. If 
the encodings during input and at the time of test are sufficiently similar, no 
retrieval search may be necessary and access may be both immediate and auto- 
matic. In the everyday world one often experiences a sudden shock of recognition 
without previously engaging in retrieval operations. 

There have been several demonstrations that the recognition of homographs 
depends on the relation between the semantic contexts in which such polysemous 
words are presented during input and at the time of test. Changes in context 
designed to induce different interpretations of the homograph impair recognition 
(99, 105). After seeing “strawberry jam” in the input list it is harder to recognize 
“yam” when confronted with “traffic jam” on the test rather than with the repeti- 
tion of “strawberry jam” or “raspberry jam.” The only surprising finding is that 
the effect of such manipulations is sometimes quite small, clearly implicating a 
context-free component of recognition learning (42). It has also been shown, 
perhaps not unexpectedly, that homographs studied as isolated words are more 
likely to be recognized when embedded in test sentences suggesting a common, 
rather than an uncommon, interpretation of the word (207). The implication of all 
these results is, of course, that homographs are encoded in a specific semantic 
sense. When “strawberry jam” is presented, the episodic trace contains informa- 
tion related to fruit preserves but not to traffic congestion. The problem would not 
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arise were it not for the peculiar economy of English spelling. There should not 
have been any doubt that semantic as well as orthographic and phonemic infor- 
mation is stored when a word is presented. In contrast to what is true of other 
words, there is no relation between alternative meanings of homographs. Thus this 
case is too atypical to permit any broad generalizations about the role of retrieval 
in recognition. 


IMAGERY 


No discussion of coding processes would be complete without a section on the role 
of imagery in learning and retention. Regrettably, we find little to report about 
broad new developments. The empirical evidence for the effectiveness of imaginal 
encoding was reviewed thoroughly in Paivio’s book (133). Since the appearance of 
the book, there has been a large number of new experiments demonstrating 
positive effects of rated imagery value and imagery instructions on performance in 
a great variety of tasks, ranging from the Brown-Peterson short-term memory 
situation to recall of meaningful sentences. There would be no point in enumer- 
ating them. Concreteness is better than abstractness and pictures are better than 
words. If one wants one good new reference for documenting these assertions, he 
can use the series of experiments reported by Paivio & Csapo (134). If anyone 
doubts that pictures are easy to remember, he should read a recent paper by 
Standing, entitled Learning 10,000 Pictures (174). This author concludes that the 
capacity of recognition memory for pictures has for all practical purposes no 
limits. It is also possible to report that, contrary to popular misconceptions, bizarre 
images and pictures are no better as memory aids than common ones (28, 124, 
210). 

Imagery value is found to have substantial effects in situations where one 
does not have very good reasons for expecting it. Verbal discrimination is 
a case in point. The effect has been repeatedly demonstrated, and an effort 
has been made to show that imagery exerts its influence independently of sit- 
uational frequency (151-153). It remains uncertain what the source of the 
imagery effect is. One possibility is that correct alternatives receive more im- 
plicit rehearsals when they have high rather than low imagery value. Attempt to 
- establish a direct relation between imagery value and subjective frequency in the 
context of verbal-discrimination studies led to inconclusive results (195). 

While empirical demonstrations of the apparent power of imaginal encodings 
continue, theoretical analyses of the underlying processes are inching forward 
slowly at best. Paivio still favors a dual-encoding interpretation, but evidence for it 
remains circumstantial (e.g. 134). Begg (15, 16) has been arguing that imaginal 
encoding favors unitization and thus results in a reduction of memory load. For 
example, concreteness of the component items favors the recall of phrases; more- 
over, the pairs are recalled in an all-or-none manner. Little attention is paid in 
such discussion to the problem of decoding. 

Increasing doubts are being expressed about the use of the concept of image as 
an explanatory construct. A thoughtful critique may be found in a recent article by 
Pylyshyn, who asks the question of “what the mind’s eye tells the mind’s brain” 


VERBAL LEARNING AND MEMORY 323 


(143). He argues that subjective experiences such as images may not play an 
essential causal role at all in psychological processes. “Cognition may be 
‘mediated’ by something quite different from either pictures or words, different in 
fact from anything that can be observed from either within or without” (p. 4). 
Some abstract underlying representation must be postulated since one can move 
easily back and forth between pictures and words. In a similar vein, Anderson & 
Bower (3) reject the “mental-picture metaphor” and argue for conceptualizing 
what is stored in propositional terms. Imagery is seen as helpful because it favors 
the encoding of propositions that are effective for storage. What is clearly needed in 
this area of research is a sustained fresh look at the complex problems of concep- 
tualization that arise in the interpretation of imagery effects. What is not needed is 
a continuing flow of demonstrations that performance is correlated with imagery 
value, and that instructions to use interacting images enhance recall. Since such 
investigations will undoubtedly continue to be carried out at a rapid rate, it is to be 
hoped at least that greater attention will be paid to potential confounding varia- 
bles such as the correlation between lexical complexity and abstractness (89). 
Galbraith & Underwood (51) reported recently that abstract words are perceived 
as having higher frequency and as occurring in a greater variety of contexts than 
concrete ones. It is far too early to take it for granted that the only important 
difference between concrete and abstract words is the ease of imaginal encoding. 


ORGANIZATION 


The ultimate sign of the success of a theoretical idea is that it comes to be taken for 
granted as part of the current body of knowledge in a discipline. This is what has 
happened to the concept of organization in recall, although some investigators still 
seem to find it useful to document it anew. There is no longer a sharp confronta- 
tion between the organizational and associationistic points of view. Transition 
experiments have shown that the operations defining the development of organi- 
zation and the establishment of associations shade into each other. Thus there are 
predictable transfer effects from free-recall to paired-associate learning which 
imply that the existence of higher-order units can significantly influence the 
development of associative relations between the members of such units (12, 13, 
81, 150). It has been shown that a program based on an associative model can 
simulate output organization with some success (1). 

There is still a great deal of exploration of the conditions conducive to the 
development of higher-order units and consistency of output order. The analysis is 
becoming increasingly fine-grained. Data keep coming in on the effectiveness of 
different kinds and numbers of categories (e.g. 14, 95) and the exact spacing of 
category instances within a list (19). The ways in which interresponse times reflect 
organization (85, 137) and the mechanisms of output interference in the recall of 
categorized lists (172) are being investigated. Studies such as these add useful 
information, but we do not discuss them in detail, and omit many others, because 
they do not seem to call for drastic changes in extant theoretical formulations. 
However, there have been some interesting theoretical developments in this area 
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as well. We will consider two problem areas in which significant changes in the 
interpretation of organizational processes have recently emerged. These concern 
the mechanism of part-to-whole and whole-to-part transfer in free recall and the 
issue of independent vs dependent item storage. 

The original demonstrations of negative part-to-whole and whole-to-part 
transfer were of considerable importance, not so much because the results were 
counterintuitive but because they gave support to a strong prediction derived from 
organization theory. The persistence of inappropriate higher-order units was 
shown to retard learning in the teeth of the advantage accruing from prior famil- 
iarization with the TBR items. It could also be inferred that for each list, short or 
long, there was an optimal organization that maximized speed of learning. Among 
the later findings serving to cast doubt on the organizational interpretation were 
the following: (a) informing the subject about the relation between the successive 
lists eliminated the apparent negative transfer effects (132); (b) there was positive 
rather than negative transfer when subjects were led to adopt a lenient criterion for 
responding (171); and (c) typical part-whole effects were obtained when successive 
lists were in fact identical (162). These results implicated difficulties of list dis- 
crimination rather than the carryover of inappropriate higher-order units as the 
source of negative transfer. That negative transfer occurs at the level of “list 
tagging” now appears to have been firmly established by Sternberg & Bower (175), 
who measured list identification as well as recall during the transfer phase. They 
show that their analysis can account satisfactorily for most of the existing transfer 
results. ) 

A chronic thorn in the side of organization theorists has been Slamecka’s (169) 
finding that cuing subjects with words from within the list failed to enhance free 
recall. This finding led him to the conclusion that organization is not a character- 
istic of trace storage. Recall becomes organized only by virtue of the adoption of a 
retrieval! plan based on the subject’s perception of the list. This argument was 
pressed home when Slamecka (170) found that in the recall of categorized lists 
cuing by specific instances from within the list served to increase access to cate- 
gories but not the recall of items within a category. Thus multiple cues were no 
more effective than single ones. The conclusion was that there is no growth of 
associative linkages among individual items within higher-order units. Subse- 
quent work served to make the theoretical issue even more acute: a negative 
relation was demonstrated between the number of instances presented as cues and 
the probability of recall of the remaining items (149, 155). As a consequence, a 
drastically revised picture of the retrieval process in the recall of organized lists is 
now being proposed. The essence of the argument is that the effective associations 
are not among the individual items but rather between the items and a retrieval 
cue (155) or control element (109, 149). The higher-order unit is seen as being 
organized hierarchically. Access to individual items is not horizontal but rather 
through the control element. Cuing by individual instances has a negative effect 
because the strength of the association of the presented items to the control 
element is increased by virtue of their presentation, and the accessibility of other 
items is correspondingly reduced. It is too early to evaluate the promise of this new 
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theoretical formulation which does indeed represent a radical departure from 
established principles and is counter-intuitive in denying the operation of inter- 
item associations in recall. An early question to consider is how applicable this 
interpretative schema is to the recall of noncategorized lists. There is an obvious 
risk that principles derived from results obtained with categorized lists may prove 
to have limited generality. 


ATTRIBUTES OF MEMORY 


One of the most important new developments in our field is the steady growth of 
interest in the multiplicity of attributes that constitute the memory of an event. We 
remember not only what.occurred, but also when, where, how frequently, and so 
on. This conception of memory was set forth in a seminal paper by Underwood 
(184), which also introduced the distinction between two major classes of attri- 
butes—those serving a discriminative function on the one hand and those instru- 
mental in retrieval on the other. Research on attributes has been directed toward 
two general objectives: (a) to explore the range of attributes that are remembered 
and thereby to gain some understanding of the characteristics of the memory 
trace; (b) to determine by experimental analysis the function of specific attributes 
in the processes of remembering. We will illustrate these two approaches with the 
aid of some representative studies. 


Repetition and the Representation of Events in Memory 


How is the repeated experience of the same nominal event recorded in memory? 
The historical answer was, of course, that the trace of that event was strengthened. 
The results obtained in a programmatic series of studies by Hintzman and his 
associates on memory for attributes has provided decisive evidence against this 
conception. Some of the basic findings are these : (a) Subjects can judge with some 
accuracy not only the frequency with which items have been repeated within a list 
but also whether the repetitions were massed or spaced (67). (b) When a word is 
presented twice within a list, subjects retain information about the serial positions 
in which the repetitions occurred; moreover, the judgments of the two serial 
positions are essentially independent (68). (c) When repetitions of a word are 
distributed over two lists, the subjects retain information about the separate 
frequencies of occurrence in each of the target lists, although there is also some 
effect of overall frequency (68). (d) The identification of the list membership of an 
item is mediated by both frequency and recency information (70). Results such as 
these show clearly that information about the time of occurrence and the spacing 
of repeated events is retained in memory; information about spacing may be 
derived from implicit judgments of recency (69). It is apparent that these discrim- 
inations could not be based on the changes in strength of a single trace which is 
incremented by each repetition; the facts are more consistent with a multiple-trace 
hypothesis which holds that repeated occurrences of an event produce multiple 
representations in memory that preserve information about the temporal order of 
repetitions (cf 74). 
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Frequency Discrimination and Recognition 


In a programmatic series of investigations, Underwood and his associates have 
accumulated an impressive body of data in support of the hypothesis that fre- 
quency discrimination is a basic mechanism of recognition. Information is stored 
about both the background frequency and the situational frequency of events. The 
former represents the record of long-term cumulative experience and the latter the 
number of occurrences in a specific context such as an experimental situation. (One 
may note a parallel here to the distinction between semantic and episodic mem- 
ory.) In experiments on recognition, the discrimination between situational 
frequencies of 0 (distractors) and 1 (targets) is sufficient for a correct decision. The 
fact that accurate relative judgments of situational frequency can be made even 
when distractors have higher background frequencies than targets argues directly 
against trace strength as the basis of the discrimination and in favor of viewing 
frequency as a distinctive attribute of memory. Perhaps the most direct demon- 
stration of the dependence of recognition on frequency discrimination is the close 
correspondence between frequency judgments and recognition decisions follow- 
ing a given verbal input (188). Absolute and relative frequency judgments are 
highly correlated and both predict the accuracy of recognition (194). 

A number of interesting predictions have been derived from frequency theory 
and confirmed experimentally. One of these concerns the development of syste- 
matic changes in performance in the course of a recognition test. When the same 
distractor is used repeatedly with different targets, recognition accuracy declines 
progressively because the distractors gain situational frequency; correspondingly, 
performance improves when the same target is presented repeatedly, each time 
with a different distractor (191). While these results provided impressive support 
for the theory, they also made it clear that attributes other than frequency can 
serve to mediate recognition: performance failed to decline to a chance level when 
the situational frequency of the distractors exceeded that of the targets. Other 
factors, such as the semantic characteristics of individual words (cf 189), un- 
doubtedly influence recognition decisions. However, it is also a fact that the 
learner is remarkably sensitive to variations in situational frequency. For example, 
the frequency that accrues to the syllables of input words is reflected in the error 
probabilities when the syllables are subsequently combined into new words on the- 
recognition test (193). But there are also important limitations on the individual’s 
ability to store and utilize information about the situational frequency of specific 
items. Thus the presence of conceptually related items in a list leads to the 
overestimation of the frequencies of individual words, probably because the 
subject is unable to differentiate between presented frequencies and those pro- 
duced by displaced rehearsals (163). 

The frequency theory of verbal discrimination learning continues to receive 
impressive empirical support. Study after study shows that performance in a verbal 
discrimination task reflects directly the relative frequencies that accrue, through 
either overt or covert processing, to the correct and incorrect alternatives. Other 
factors undoubtedly play a part; as in the case of recognition, performance often ! 
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fails to deteriorate as severely as it should when discrimination on the basis of 
frequency can be assumed to be difficult or impossible (e.g. 192). What these other 
factors might be remains uncertain. The hypothesis that associations incidentally 
formed between the members of pairs play a significant causal role in verbal 
discrimination learning (50) can now be ruled out on the basis of a number of 
critical tests (214). l 

Perhaps the most urgent problem facing frequency theory is to account for the 
loss of frequency information over time. The losses are extensive, but how do they 
come about? A promising hypothesis was that there is a progressive assimilation of 
situational frequency to background frequency, but this interpretation has re- 
ceived only partial empirical support. Retroactive and proactive inhibition occur, 
but the observed functional relations do not always conform to the predictions of 
frequency theory (190, 194). If these difficulties can be resolved, we may gain a 
new purchase not only on the forgetting of frequency information but also on the 
conditions determining the loss over time of other attributes of memory. 


SOME IMPORTANT OMISSIONS FROM THIS REVIEW 


This review does not contain a section on interference theory and forgetting. 
Interference theory today is in a state of ferment if not disarray. There have been 
powerful challenges to traditional formulations (61, 107), and responses to these 
challenges, but the critical experiments that will permit a decision between the 
contending theories remain to be performed. There is no lack of new data on 
retroaction and proaction, but so far they have failed to resolve the basic theore- 
tical issues. The literature is both voluminous and technical. Since it has been 
reviewed elsewhere, it seemed reasonable to omit it from this review. We have also 
not discussed the recent work on intentional or directed forgetting. Excellent 
summaries of this literature are available (18, 46). While there are specific exper- 
imental operations that define directed forgetting, it appears to us that the concept 
is an essentially hybrid one. The results of the experiments have given us valuable 
information about the effects of selective rehearsal, the functional grouping of 
items in memory, and the consequences of selective memory search. But they tell 
us little about forgetting in the basic sense—why information that has been ac- 
quired and used is subsequently lost. Many other important topics have not been 
mentioned lest one’s limited-capacity processor be strained unduly. 
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The scope and volume of the literature on both counseling and student 
development has increased dramatically over the past several years. The literature 
on both subjects has diverged to the point where we have recommended to the 
editorial board of the Annual Review of Psychology that the topics be separated in 
future issues. We have chosen to separate them in our presentation. 

It should be underscored that the limitations of space and the intent of the 
editor’s charge to us was “to present a broad view of the progress, direction, and 
purpose of current research...” in the context of a “selective—not comprehen- 
sive—review of the literature, and a critical analysis of the present state of the sub- 
ject assigned.” 

Our initial draft was twice the length of our allotted pages. At that we did not 
feel that we had done the subjects justice. It must be recognized that it is possible, 
and indeed it has been done, to write books on much less subject matter than is 
covered here. 


THE COUNSELING LITERATURE 


This has been an extraordinary period in the development of the counseling 
literature. The initial research forays, particularly in client-centered and behav- 
ioral counseling in the 1950s and 1960s, have resulted in the development of a 
range of valid and reliable research instruments which have made possible much 
productive basic research. 

On a number of topics it has been necessary to cite findings from literature prior 
to 1971, because they are necessary to understanding the growth of our knowledge, 
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particularly as they relate to the problems of replication, therapist-offered condi- 
tions such as empathy, the use of confrontation to promote client self-exploration, 
counselor selection, criterion of outcome issues, and the role of self-disclosure for 
both counselor and client in effecting change as a result ofthe counseling process. 

We have tried to strike a balance between alerting readers to important areas 
of inquiry by citing significant research results, and directing attention to meth- 
odological issues which can assist future researchers in further improving the pow- 
er of our counseling research tools. Lastly, we have attempted to identify certain 
areas which merit intensive research activity in the immediate future. 


Empathy and Other Therapist-Offered Conditions 


Meltzoff & Kornreich (105) critiqued the research done prior to 1970 on empathy 
and other, what they termed, “therapist-offered conditions.” Their concerns pro- 
vide a useful baseline for analysis of the studies since then. A first concern is the 
limitation inherent in studying therapist-offered conditions, as that topic fails to 
consider the interaction between counselor and client, and any relevant 
client-offered conditions. 

A second concern identified by Meltzoff & Kornreich (105) is that in most 
process studies the various facilitative variables are: “... related only to the 
patient’s self-exploration. The implicit, and sometimes explicit, argument is that 
the facilitative conditions lead to self-exploration, which in turn leads to favorable 
outcome. Both hypotheses remain to be firmly established” (p. 402). There are, of 
course, major differences of opinion about whether the hypotheses remain to be 
established. Patterson (117), Carkhuff (30), Carkhuff & Berenson (38), Truax & 
Carkhuff (135) and Truax & Mitchell (137) offer considerable evidence in support 
of the relationship between what Meltzoff & Kornreich (105) label therapist- 
offered conditions and client change. 

A third concern of Meltzoff & Kornreich (105) is that it would be a mistake “to 
freeze therapeutic research in process into the current versions of these facilitative 
conditions when the results are ambiguous and much remains to be explored” 
(p. 402). This concern appeared to be based on indications from research that 
self-exploration may not be the only vehicle to change, or may not lead to change, 
that the therapeutic conditions may be accounted for by one major variable, or 
that some simpler, specifiable therapist behaviors may underlie these conditions. 

With the concerns of Meltzoff & Kornreich (105) as a preamble, the literature 
on empathy from 1970 through March 1974 is not likely to produce converts to the 
idea that the basic research problems have been solved. 

Heck & Davis (68), using a counseling analogue format, investigated the nature 
of the interaction between counselor and client as it contributes to the differential 
level at which empathy is offered. As such, it is research that directly investigates 
the first problem area identified by Meltzoff & Kornreich (105)—namely, the 
interaction between counselor and client as that interaction affects the behavior of 
the counselor. 

In this study, Heck & Davis (68) were interested in whether certain counselor- 
client characteristics interact and produce an effect upon the counselor’s level of 
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expressed empathy. They report that the level at which empathy is expressed is 
affected by differences in clients. Another finding of the study bears reporting; 
namely, evidence supporting the contention “derived from conceptual system 
theory that counselors who differ in conceptual level have different base levels of 
empathy” (p. 103). This may indicate one source of the differences between people 
in their capacity to offer an empathic response. Further research ought to be 
undertaken to explore the implications of this and its potential utility as a selection 
device for counselor training. 

Gurman’s (66) study of the therapeutic conditions of empathy, warmth, and 
genuineness within each session by an individual therapist clearly demon- 
strates the increasing sophistication of the current investigations in the area of 
therapist-offered conditions. 

This is an important area of inquiry since implicit in prior research models 
(Wogan 147) has been the assumption that these helping variables were a constant 
for a therapist at some fixed level of facilitativeness. Rogers et al (123, p. 11), for 
example, had taken the position that the helping variables reflected “underlying 
attitudes” or “basic feelings.” 

A series of earlier investigators (Kiesler, Mathieu & Klein 92; Moos & Macin- 
tosh 109; Vesprani 139) starting with Truax (133) began to question the stability of 
these therapist variables. Wogan (147) further questioned the reliability of the 
usual measure of empathy, noting that “since most research uses single measures 
of empathy, taken at one point in time, there are questions of the reliability of 
empathy itself” (p. 112). 

The purpose of Gurman’s (66) study was to assess the “consistency of thera- 
peutic functioning within and across sessions with repeated measures of facilita- 
tiveness within the same therapist-patient dyads” (p. 16). Gurman employed 
“systematically selected therapists in a multiple, single case format,” utilizing a 
research approach presented by Bergin & Strupp (20), Chassan (40), and David- 
son & Costello (49). Therapist functioning was rated on the Bergin & Solomon 
(19) revision of the Truax Accurate Empathy Scale and on the Truax scales of 
Nonpossessive Warmth and Genuineness (Truax & Carkhuff 135). 

Two significant findings resulted from this study. First, both high- and low- 
functioning therapists vary in their functioning, both within and across sessions. 
Second, therapeutic conditions tend to reach peak levels within sessions in the 
mid-late and late segments of the therapy hour. 

This study directly challenges the usual methodology in studies of therapeutic 
facilitativeness with respect to sampling of therapist functioning for rating pur- 
poses. It gives added impetus to Kiesler’s (90) point that determination of which 
segments to sample ought to be based on the theory from which the therapist is 
rating. Further, the Mintz & Luborsky (106) finding that ratings based on brief 
segments could not be considered as accurate approximations to session-based 
judgments deserves even more careful attention from researchers in light of this 
finding. 

A final issue raised by Gurman (66) is the question of whether all therapist 
responses are of equal importance in therapy. Truax & Carkhuff (135) had earlier 
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questioned whether or not peak levels of functioning might not be more predictive 
of outcome than overall level. Gurman asserts quite persuasively that this area 
represents an important one for future inquiry. 

Grief & Hogan (65) presented a paper on the theory and measurement of 
empathy from quite a different perspective than the client-centered tradition. 
Their work begins by indicating that empathy refers to “a sensitivity to the needs 
and values of others,” and identifies it as a “major element in role-theoretical 
accounts of interpersonal behavior.” The role-theoretical literature was noted by 
the authors to include such references as Cottrell (45), Goffman (61), Kelly (89), 
McDougall (102), Mead (103), and Sarbin & Allen (125). 

The work of Grief & Hogan (65), based on a scale derived from the California 
Personality Inventory and the Minnesota Multiphasic Personality Inventory, adds 
a potentially different definition to the term of empathy in the literature. This 
brings to mind the research questioning the meaning of accurate empathy ratings 
(Chinsky & Rappaport 43; Kiesler, Mathieu & Klein 92). 

A second thread of literature introduced by Grief & Hogan focused on the 
relationship of the concept of empathy to understanding moral development and 
moral conduct. Cited here are such references as Baier (10), Hogan (76), and 
Wright (149). What is particularly interesting about this set of literature citations is 
that there exists an entirely different set of researchers (Mosher & Sprinthall 111) 
independently drawing upon the relationship of empathy to moral development. 
Mosher & Sprinthall (111) were teaching counseling empathy skills to high school 
students in order to promote moral development as defined by Kohlberg (94-97). 

Both Grief & Hogan (65) and Mosher & Sprinthall (111) appear to the writers of 
this review article to be investigating one of the frontier areas in counseling 
research. The fact that they began publishing about the same time from such 
different schools of thought and methodology reflects both the nature of interdis- 
ciplinary scientific inquiry and the fact that the counseling literature is suffering 
from an information retrieval problem of the first magnitude, particularly in the 
area of the applicability of work from other disciplines. 


Self-Disclosure 


The passage of time has allowed systematic investigation to broaden our knowl- 
edge in important ways. Our increased understanding of self-disclosure is a case 
in point. Jourard’s (82) and Mowrer’s (112) initial work, further developed by 
Carkhuff & Berenson (38), Truax & Carkhuff (135), Brahan (25), Strong & 
Schmidt (132), Schmidt & Strong (128), Strong & Dixon (131), has laid the 
groundwork for detailed research to the point that Murphy & Strong (113) were 
able to demonstrate the influence of self-disclosure in helping another person feel 
warmth, friendliness, and being understood, and the significance of the timing of a 
self-disclosure. Giannandrea & Murphy (60), in an extension of the Murphy & 
Strong (113) work, investigated the relationship between frequency of similarity 
self-disclosure and return for a second interveiw. A curvilinear relationship was 
found in which more students return for a second interview after receiving a 
moderate number of disclosures than after receiving few or many disclosures. 
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Heilbrun (75) investigated the relationship between difficulty in self-disclosure 
to early termination in therapy. This research brings together several differ- 
ent threads of investigation, including role expectancy theory (69-74) and the 
self-disclosure of Jourard (82). The hypothesis that females who terminate 
prematurely from therapy are those who have greater difficulty engaging in 
self-disclosure, especially with males, was tested. He found the opposite to be the 
case. Probable terminators were higher self-disclosers than probable continuers. 
All females, however, disclosed more to females than to males. The use of role- 
expectancy theory was helpful in his interpretation of the results, particularly in 
suggesting implications for further research. 

Weigel et al (143) reported that therapist self-disclosure may lead to client 

opinions of unprofessional conduct or questions about the therapist’s mental 
health. While a therapist may be better liked by group members if he is self- 
disclosing, Weigel & Warnath (144) found the therapist may be judged as less 
healthy mentally. In a related study, Truax, Carkhuff & Kodman (136) found that 
deviation from a “professional role” through self-revelation may lead to being 
viewed as less helpful. 
‘ The notion of what is expected therapist conduct is, of course, central. It is 
beyond the scope of this paper to review the many differing conceptions. Suffice it 
to say that the few previous studies (Culbert 47, Bolman 23, Kangas 86) which have 
investigated the effects of therapist self-disclosure do not provide any definitive 
conclusions. 

Dies (51) continued this line of inquiry by getting clients from ten different 
therapy groups to evaluate their leaders on the Group Therapist Orientation Scale 
(50), which assesses opinions about the appropriateness of therapist self- 
disclosure. Clients were also asked to evaluate their therapists on a set of 20 
bipolar adjectives designed to obtain information on therapist likability, perceived 
levels of helpfulness, and emotional stability. The results of this study found that 
self-disclosing therapists were judged by their clients as more friendly, disclosing, 
trusting, intimate, helpful, and facilitating (Factor 1), yet also as being less relaxed, 
strong, stable, and sensitive (Factor 2). A related finding was that the longer a 
client was in therapy, the more favorable were his attitudes likely to be toward 
therapist self-disclosure. 

May & Thompson (100), in a study related to the Dies (51) work, investigated 
further the relationship between therapist self-disclosure, mental health, and 
helpfulness as perceived by group members. Another part of this work was 
whether the group member’s opinion of therapist self-disclosure and mental 
health was related to the member’s own self-disclosure level and mental health. 
Significant positive correlations were found among the three therapist dimensions 
of self-disclosure, helpfulness, and mental health. No significant differences were 
found in the ratings of group leaders by those group members ranked highest and 
those ranked lowest on the member variables of self-disclosure and mental health. 
May & Thompson provide a thoughtful discussion of possible reasons why the 
findings from their study are at variance with those of Weigel et al (143). Since 
Dies (51) appeared in the same journal issue as did May & Thompson (100), 
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neither was able to discuss the other’s results. There is much yet to clarify in this 
area of inquiry. Further research is essential. 


Confrontation and Self-Exploration ` 


The work on confrontation is another example of progressive, systematic research 
which has advanced our knowledge of key counseling concepts. Berenson and 
co-workers (16, 17), in their investigations of characteristics of high and low 
facilitators, found that high facilitators more frequently used confrontation. An- 
derson (2) found that within the first therapy hour client self-exploration was 
influenced by confrontation in a differential manner depending on whether the 
counselor was a high or a low facilitator. Confrontation was related to increased 
client self-exploration when accompanied by high levels of the facilitative condi- 
tions. Under low levels of facilitative conditions, confrontation was not followed 
by increased self-exploration. In further research, Anderson (3) used an analysis of 
variance design to compare the effect of therapist level, client level, confrontation 
type, and whether it was the first or fourth interview. She found three significant 
results: (a) confrontations were followed by increased client self-exploration to 
a significantly greater degree from high-functioning therapists than from low- 
functioning therapists; (b) the fourth interview as compared to the first pro- 
duced greater variability in client depth of self-exploration; (c) greater variability 
in type of confrontation was found in high-functioning therapists. 

Mitchell & Namanek (108) undertook a replication of the Anderson (2) study as 
well as a considerable extension of confrontation research. They investigated the 
effects of therapist confrontation on subsequent client and therapist behavior 
within the first therapy interview. Importantly, Anderson’s (2) findings were not 
replicated. No mention was made of the related Anderson (3) study. 

Confrontation was found by Mitchell & Namanek (108) to have had little 
overall impact on client self-exploration within the first session of therapy. A 
number of intriguing avenues for further research were outlined, including the 
study of the effects of confrontation over the entire course of psychotherapy. 

Kaul, Kaul & Bednar (87) investigated the response styles of eight counselors, 
all functioning at or above minimally facilitative levels of therapeutic core 
conditions. These counselors were classified as confrontative or speculative in 
order to study the effects of response style upon client self-exploration. Estimates 
were made by the counselors, the clients, and by trained raters of client depth of 
self-exploration. In summarizing the previous literature,” Kaul, Kaul & Bednar 
(87) noted that an implication from previous studies is that counselor confronta- 
tion leads to increased self-exploration by clients, “at least when the counselor is 
judged to be performing at minimally facilitative levels of therapeutic core con- 


> A problem in the Kaul, Kaul & Bednar literature review is that the Mitchell & Namanek 
paper (108) had not appeared at the time they submitted their review. Mitchell & Namanek 
(108) importantly failed to replicate Anderson (2), a fact Kaul, Kaul & Bednar could not 
have known. They also failed to review Berenson, Mitchell & Laney (16), and Mitchell & 
Berenson (107). 
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ditions” (p. 132). Such an implication they believe to be tenuous, since “the 
technique of confrontation has been confounded with therapeutic levels.” 
Another research problem they note is that the consequences of confrontations 
have been assessed immediately following the technique, and that no compelling 
evidence exists to support the belief that “the consequences of confrontations will 
manifest themselves immediately.” 

Results of the research by Kaul, Kaul & Bednar (87) were as follows: 1. Con- 
frontative counselors perceived greater self-exploration in their clients than did 
speculative counselors, but clients perceived greater self-exploration than did 
counselors or judges. 2. Clients of confrontative and speculative counselors did not 
differ in their perceptions of their self-exploration. 3. Importantly, trained raters 
did not differ in their perceptions of self-exploration between clients of confron- 
tative or speculative counselors. | 

There certainly was not support for the general hypothesis that counselor 
confrontation leads to client self-exploration. Because of certain methodological 
differences, the authors caution that their results “cannot be interpreted as com- 
pletely contradictory to the results of earlier studies.” They do note that the fact 
trained raters did not differ in their assessment of self-exploration “across con- 
frontive and speculative response styles contradicts Anderson’s (2, 3) data” (87, p. 
135). 

The explanation offered by Kaul, Kaul & Bednar (87) is that: 


... both groups of counselors were functioning at high levels of facilitative conditions 
and both were actively engaged in productive, therapeutic activities. The implication is 
that client self-exploration is more a function of the counselor’s overall effectiveness 
than of the specific modality through which it may be expressed (p. 135). 


The authors note that investigating the above line of reasoning represents a 
potentially fruitful area for further research. 

- The studies on confrontation cited above, taken together, are important in that 
they indicate the extraordinary complexity of research on counseling variables, 
the difficulty inherent in the isolation of the effect of a single therapist-offered 
condition imbedded in overall therapist behavior, the necessity of careful repli- 
cation, the appraising of all the relevant literature, and the benefits in increased 
understanding which accrue from systematic investigation over a period of time. 
While confrontation as a counselor skill may have promise, its effects on clients 
over the course of counseling have barely begun to be studied. 


Training in Basic Human Helping Skills 


The last decade, and particularly the period of time covered by this review, has 
marked a major advance in the methodology of training counselors. Adopting 
very specific, observable skills as the basis, a number of innovators have developed 
training packages which are now receiving research scrutiny. A partial listing of 
these training innovators would include, along with their associates, Carkhuff (30, 
32-36), Kagan (83-85), Hosford (80), Ivey (81), Gazda (59), Danish & Hauer (48), 
and Brammer (26). It is beyond the scope of this review to undertake a systematic 
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evaluation of these various approaches to teaching human helping skills. The 
relative newness has precluded the development of an empirical assessment of 
their comparative effectiveness, though there has been considerable research done 
within each system. Given the importance to society of developing potent methods 
for teaching human helping skills which will benefit both counselors in training 
and members of the general public who wish to be paraprofessionals, this area 
must be assigned priority for investigation in the immediate future. 

Several research studies which have investigated elements of training deserve 
mention. Fry (58) found that pairing helping skills training with desensitization 
improves post-training ratings of the trainee’s helping function. 

Butler & Hansen (28) investigated two important questions in training: whether 
persons rated prior to training as being low or moderate in level of facilitation 
experience differential effects from counselor training, and whether the post- 
training level of each group persists over time. The authors found that levels of 
facilitation can be increased for both groups, and that these levels are maintained 
over a 4-week post-training period. The results suggested that moderately func- 
tioning persons benefited more from training, though this finding was not defini- 
tive. ` 

Saltmarsh (124) and Birk (22) both presented approaches for assisting counse- 
lors in developing empathy skills. Saltmarsh’s approach was to utilize programmed 
instruction consisting of printed materials and tape-directed interaction focus- 
ing on each of the five identified components which he assumed to be supportive 
of empathy. The evidence for the validity of the components was not presented. 
He did find significant differences between the experimental group which had his 
training program and a control group which had discussions about “relationship 
dynamics” on the Kagan et al (85) Michigan State Affective Sensitivity Scale, 
which is a valid and reliable instrument for measuring affective sensitivity. A 
weakness in this particular study which the author noted was that the experimental 
group focused intensely on empathy while the control group did not. The specific 
defects of this study are correctable in future research. The idea of using alternate 
approaches such as programmed instruction to teaching empathy is worthy of 
further study. 

Birk (22) investigated the effects of a didactic and an experiential counseling 
supervision method in developing empathy. In addition, she found that supervisee 
preference for a particular supervision had no effect. With respect to which type of 
supervision was most effective in developing empathy, her findings supported the 
didactic over the experiential, as did Payne & Gralinski’s (119). 


Selection of Counselors and the Criterion Problem 


Counselor selection studies constitute a research area where little progress has 
been made, in contrast with developments in training research. A central issue has 
been the definitions of effectiveness and ineffectiveness employed in labeling 
subject groups and their relation to an outcome criterion for client behavior. In 
reviewing developments in thinking and research regarding the criterion problem, 
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it was considered useful to draw on psychotherapy as well as counseling literature 
to provide a complete examination of the issues. 

In the last 7 years, three reviews of the literature on counselor selection—Pat- 
terson (116), Allen & Whiteley (1) and Whiteley (146)—have agreed specifically on 
two criticisms of predictive studies: the inadequate definition of the criterion, and 
the lack of a theoretical rationale for selection of variables. The two are obviously 
interrelated. 

Allen & Whiteley’s (1) analysis of the difficulties in yielding predictive validi- 
ties in the studies they reviewed is that researchers proceeded in the assumption 
that since personality differences exist between “good” and “bad” counselors, all 
one had to do is “administer the standard personality tests and tabulate the 
differences” (p. 7). They further observed that no existing test of personality can 
carry the measurement and conceptualization of the whole personality, and that it 
is unlikely that any “specifiable aggregates of individual personality traits” will be 
found to discriminate between effective and ineffective counselors. Clearly, the 
direction of theory is required. 

The relevance to counselor behavior of personality measurements developed 
for assessment of psychopathology in patient populations must also be seriously 
questioned. The use of these and other standardized measurements in counselor 
selection research “implicitly makes,” as Meltzoff & Kornreich note in their 1970 
review, “their authors usurp the place of the researcher as the creator of hypoth- 

eses for his experiment” (105, p. 310). The record of research in this area indicates 
a willingness on the part of many investigators to relinquish this function, in the 
absence of viable theoretical models, perhaps in the hope of discovering by 
coincidence the “right fish” with which to begin to stock one’s own pond. 

The quest for a theoretical formulation takes us back to the criterion problem. 
Whiteley (146) has argued that “By assessing what the counselor does (his actual 
behavior), criteria of competence can be developed.... by attempting first to 
specify the counselor behaviors which are associated with client change, it be- 
comes possible to relate these behaviors to higher-order personality factors within 
a systematic rationale” (pp. 175-76). In summary, before any progress can be 
made in a theoretical formulation of the personality dimensions related to effec- 
tiveness in counseling, more rigorous consideration must be given to the definition 

‘of the criterion—effectiveness—and that definition must be in terms of specifiable 
counseling behaviors which are clearly and predictably associated with client 
change. 


THE CRITERION PROBLEM—PSYCHOTHERAPY: EFFECTIVE OR INEFFECTIVE? The 
publication in 1952 of the first of a series of papers by Hans Eysenck, questioning 
the efficacy of traditional psychotherapy, brought forth a flurry of research activity 
to demonstrate the effects of psychotherapy and a refocus and reevaluation of the 
investigative models employed in studying the process and outcome of therapy. 
Eysenck’s (53) challenge spearheaded the impact of behavioral science on the 
theory and practice of psychotherapy, particularly on the criteria for specifying 
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outcome and measures of change. At the same time there was a marked increase in 
preference for behavioral models of therapy and their more operationally speci- 
fiable treatment and outcome variables. While some were turning to learning 
theory for techniques demonstrably potent in changing behavior, many others 
renewed their efforts to validate the more traditional forms of psychotherapy. 

Eysenck’s critique continues to be the subject of debate regarding assessment of 
outcome, and every review of the literature in this area begins with his survey, 
although Bergin (18), in a refreshing departure, remarks, “... even though some 
people like myself are still writing about it, I, at least, know that I am writing 
history, not news” (p. 218). 

Bergin (18), in his review of Eysenck’s work and the ensuing 20 years of 
controversy and productivity it provoked, concludes that the bases for the issues 
raised by Eysenck have quietly dissipated and that there is evidence of significant 
change in clients receiving the more traditional forms of psychotherapy. 

In a review of 48 recent studies of outcome, Bergin (18) demonstrated a slight 
positive relationship between more rigorous design and more positive outcome. 
Bergin examined these studies for, among other things, thoroughness and ade- 
quacy of design. He considers his evidence conclusive enough to “dispel the notion 
that experimental rigor washes out significance of results in tests of e 
effectiveness” (p. 238). 

Meltzoff (104) and Meltzoff & Kornreich (105), in their thorough reviews of the 
research literature, assert definitively that the evidence for the effectiveness of 
psychotherapy is in. Among the 101 studies they examined, 56 percent (57 studies) 
were considered “sufficiently adequate in design and execution for valid conclu- 
sions to be drawn.” Of these, 84 percent (48 studies) yielded positive outcomes of 
psychotherapy. 

Meltzoff & Kornreich (105) attribute the differences between the conclusions 
arrived at in their review and earlier ones (Eysenck 53-55, Truax & Carkhuff 135) 
to the greater scope and comprehensiveness of the literature they reviewed, as well 
as to the use of stringent criteria for assessing the adequacy of design of studies 
reviewed. They selected material sufficiently inclusive in scope to investigate 
adequately the issues central to the question of effectiveness (which it can be 
` argued had never been done prior to these reviews and Bergin’s which followed 
shortly after). By assigning differential values to the evidence in terms of design 
adequacy, they were able to conclude, in agreement with Bergin, that “the better 
the quality of research, the more positive the results obtained” (105, p. 177). 

While the studies reviewed by Bergin (18), Meltzoff & Kornreich (105), and 
Kellner (88) represent significant improvement in design and in the specificity of 
the criterion, the criterion itself remains a problem in interpretation. The use of 
subjective ratings by therapists or patients as criterion measures has declined 
considerably. Meltzoff & Kornreich (105) found only 6 to 7 percent of all studies, 
regardless of adequacy of design, were using these kinds of ratings. Rather, 
observed behavior, personality inventories, and rated behavior were more uti- 
lized, often in combination, and were particularly prevalent in studies which the 
authors rated as adequate in design. 

However, these less subjective criteria are still sources of difficulty in interpre- 
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tation due to the differences among various theoretical systems in assumptions 
about what constitutes change and the valence of its direction. More importantly, 
these kinds of outcome criteria have not been more efficacious than the more 
subjective ones in the task of identifying the therapist behaviors relevant to them. 
Kiesler (91), in his analysis of what he has labeled the “uniformity myth,” i.e. the 
“quest for the one system of therapist behavior that would produce constructive 
personality change for patients” argues that “our psychotherapy research designs 
can no longer incorporate these uniformity myths” (p. 40). 

The uniformity assumption is that clients on the one hand and therapists on the 
other represent “uniform, homogenous groups, more alike than different,” and 
that outcome represents “a uniform and homogenous patient-change dimen- 
sion... that would reflect for any patient the effects of psychotherapy” (p. 40). 

Increasingly the argument is being made for specific rather than global criteria 
for client change in outcome studies. Individually specified criteria for construc- 
tive personality change would take into account the fact that what represents 
improvement for one client in a particular direction on a particular variable may 
not represent improvement for another. Bergin (18) raises an interesting question 
on this point by asking whether a reevaluation of past outcome research would 
show that “meager results of many studies may have resulted from misapplica- 
tions of the same criteria to different patients” (p. 259). 

Clearly, the central issues in outcome research have moved beyond the 
effective-ineffective debate, and the assumption of a unitary process applied to a 
unitary problem has been firmly dispatched. Such a conceptualization lacks the 
potency and relevance to generate further progress in counseling and psycho- 
therapy research. Bergin & Strupp’s (21) position is indicative of the direction 
outcome research must now take: “We feel the problem of psychotherapy research 
in its more general terms should be reformulated as a standard scientific question: 
What specific therapeutic interventions produce specific changes in specific pa- 
tients under specific conditions?” (p. 8). 

Certainly another long-standing difficulty in outcome research has been the 
tendency to utilize counselors and therapists who are in training rather than 
experienced, professionally qualified persons. It is unlikely that any other profes- 
sion would attempt to examine the nature of its skills and the effects of its 
treatments by studying minimally or inadequately trained members. Bergin’s 
review (18) offers some evidence that “outcomes differ significantly as a function 
of experience level” (p. 237). He reports that “53 percent of the studies involving 
experienced therapists (20 of 38) yielded positive results, while 18 percent (2 of 11) 
by inexperienced therapists were positive” (p. 237). 

Bergin’s data and the lack of logic behind this practice should be enough to 
dissuade researchers tempted by the all-too-available student-therapist popula- 
tion. A further argument against this practice can be found in the evidence for 
change in therapists’ attitudes and self-reported change in technique between 
internship and attainment of several years’ experience. 


THE RELATIONSHIP AS AN INTERMEDIATE OUTCOME The quest to determine what 
behaviors of the therapist are associated with positive outcome has led some 
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researchers to look for commonalities among therapists of different schools and 
training backgrounds. The assumption is that one can abstract the essence of the 
therapeutic process from a discovery of the commonalities. These efforts are 
predicated upon a unitary view of psychotherapy in contrast to the multidimen- 
sional position taken by Bergin & Strupp (21) and Kiesler (91). 

Diversity of opinion among therapeutic systems is greatly in evidence in the 
theoretical literature in (a) the wide variety of conceptualizations utilized to 
explain a client’s behavior; (b) the difference in time period of the client’s life 
which is regarded important for therapeutic focus; (c) the nature of the therapeutic 
interventions; (d) the implicit and explicit tasks required of the client by the 
therapist; and (e) the forms of verbal interactions in the therapeutic process, The 
one area of agreement does appear to be in the centrality of the interpersonal 
relationship between the therapist and the client, particularly in the more tradi- 
tional psychotherapy literature. Bordin (24) affirms this in his observation: “Vir- 
tually all efforts to theorize about psychotherapy are intended to describe and 
explain what attributes of the interactions between the therapist and the patient 
will account for whatever behavior change results” (p. 235). 

The work of Truax & Carkhuff (135) and their associates, who represent their 
efforts as having identified “certain prepotent therapist variables that cut across 
most theoretical models of therapy,” and which they offer as constituting the basic 
conditions of a therapeutic relationship, is the most extensively developed and 
rigorously research-supported endeavor arising from the experiential, client- 
centered, relationship-oriented (as opposed to technique-oriented) school of ther- ` 
apy which traces its theoretical paternity to Carl Rogers (120-122). ° 

The definition and delineation of these therapist conditions and their relation to 
outcome have been presented and summarized in a number of articles and books, 
the most comprehensive being those by Carkhuff and his associates (31, °37, 38, 
135, 137). Recent research in this area has been reviewed elsewhere in this chapter. 

Process research from a variety of areas of study have demonstrated the effects 
of therapist and relationship variables on client behavior. Bandura, Lipsher & 
Miller (12) found that therapist avoidance responses to client expressions of 
hostility inhibited further expressions of hostility or caused clients to change the 
object of hostility. Goldstein, Heller & Sechrest (62) demonstrated a relationship 
between the degree of facilitative conditions present in the therapy relationship 
and the degree of interpersonal attraction between therapist and client. Goldstein 
& Simonson (63) found interpersonal attraction to be significantly related to the 
type of client behavior engaged in during the interview, i.e. during a condition of 
positive attraction the client “(1) was less covertly resistive, (2) talks more, (3) is 
self-descriptively sicker, and (4) has more favorable prognostic expectancies for 
himself.” Correspondingly, the more attracted the therapist is to the client, “(1) the 
more the client talks, (2) the more open he is in the content of his communications, 
and (3) the less overtly and covertly resistive he is” (p. 164). 

Kratochvil et al (98) presented evidence that changes in the direction of coun- 
selor functioning had more effect on client functioning than did the counselor’s 


COUNSELING AND STUDENT DEVELOPMENT 349 


absolute level; i.e. low-functioning (on the “core conditions”) counselors changing 
in a positive direction had more positive effect on client’s functioning than did 
higher-functioning counselors who were deteriorating in functioning over a period 
of time. Carkhuff (31, 39) reported similar effects of counselor trainers on trainees. 
In interaction with a low-functioning trainer, the higher the trainee’s initial level of 
functioning, the greater the probability that he will deteriorate in functioning or 
terminate training. 

Vitalo (140) found that high-functioning (in terms of the “core conditions”) 
experimenters in a conditioning study were significantly more successful in pro- 
ducing high acquisition rates in subjects than were low-functioning experimenters. 
Truax (134) presented evidence that the therapist’s differential interpersonal 
reinforcement of client self-exploration led to an increase of self-exploration and 
greater therapeutic improvement in outcome measures. 

At the same time that the body of opinion among research leaders is moving 
toward a multidimensional model for outcome in which specific gains represent- 
ing the goals of the particular therapy and client sample are hypothesized and 
tested, there appear to be seeds of a consensual view of the therapeutic relation- 
ship as a criterion in the literature. Even the technique-oriented therapies are now 
according the relationship some significance, as evidenced in this statement by 
Goldstein & Wolpe (64): “In the case of interpersonal anxiety, behavior therapists 
might be giving instruction in assertive training, employing systematic desensiti- 
zation, advising, giving ‘homework,’ correcting misconceptions, all in the context 
of a warm and accepting atmosphere” (p. 222). 

Patterson (116, 118) has posited the relationship as an “intermediate criterion” 
in that it is the medium or vehicle through which change is mediated. There 
appears to be support for Patterson’s view in the conclusions of the more recent 
major reviews of the literature. Bergin & Strupp (21) list the psychological mech- 
anisms in most prominent use in the major systems of psychotherapy and note 
that “a crucial requisite for determining the extent to which the foregoing 
mechanisms can become operative is the patient’s amenability or receptivity to 
the therapist’s influence” (p. 13). 

Truax & Mitchell (137), reviewing the work of Truax, Carkhuff, and their 
associates, conceptualize the therapeutic effects of the core conditions drawing 
from a learning theory model. Their hypotheses support Patterson’s position and 
represent further convergence of conceptualization of the therapeutic process 
among theoretical systems. 

Bergin & Strupp (21), in their summary, appear to be arguing explicitly for the 
relationship as an intermediate criterion for outcome: 


Our critical look at the field suggests . . . that regardless of the therapeutic objectives and 
the technical procedures, the therapist must succeed in the following: 1. He must create 
conditions which make the patient amenable to his influence. Usually this is spoken of 
in terms of a therapeutic relationship in which the patient experiences respect, trust, and 
acceptance, which render him receptive to the therapist’s suggestions, interpretations, 
and the like. 2. While these conditions are being created, the therapist employs a variety 
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of technical procedures (desensitization, interpretations, etc.) to influence the patient in 
directions considered therapeutically desirable and ultimately intended to increase the 
patient’s independence, self-direction, and autonomy (p. 17). ) 


Goldstein & Simonson (63) conceptualized the relationship as an intermediate 
criterion in their investigation of interpersonal attraction in psychotherapy. They 
describe their efforts as 


... developing techniques for “hooking” the patients, that is, for maximizing the 
favorableness of the initial relationship so that the patient, at a minimum, returns for 
further sessions and, more maximally, is open to the therapist’s influence attempts. We 
propose no direct relationship between initial patient attraction to his therapist and the 
subsequent therapy outcome. Instead, we view the initial relationship as a possible 
potentiator or catalyst whose consequents can lead to a more favorable outcome (pp. 
161-62). 


The relationship as an intermediate outcome criterion is a promising source for 
identifying the counselor behaviors related to effectiveness in that it allows us to 
look for a set of behaviors that are basic to influencing another person, irrespective 
of ultimate therapeutic goal. The selection task then becomes that of ascertaining 
whether there are individual differences in capacity to engage in the kinds of 
behaviors which are basic to the establishment of a therapeutic relationship. 

Previous studies which utilize either the relationship itself as an intermediate 
criterion (Goldstein & Simonson 63) or the relationship behaviors which them- 
selves are regarded as therapeutic (which would include most of the studies 
employing the Truax scales or Carkhuff’s revision) have in common a criterion of 
effectiveness that reflects a response style of the counselor toward the client’s 
expressions of feelings. This response style is basically one of receptivity or ap- 
proach of the client’s affective behavior. 

The assumptions concerning the process of effectiveness differ. For some re- 
searchers the approach response style serves variously in different stages of ther- 
apy as a positive reinforcer (Bandura et al 12) and desensitizer (Truax & Mitchell 
137). For others, implicitly therapeutic interpersonal nurturance is the effect of 
approach behaviors constituting the criterion (empathy, self-disclosure, confron- 
tation) (Carkhuff 30). What is central, however, is the use of a criterion reflecting 
an approach response to client’s verbalizations of feelings. 


THE STUDENT DEVELOPMENT LITERATURE 


Preparing a synopsis of the literature on student development presents many 
problems due to the tremendous variety of studies on this topic. Not only are there 
vast numbers of individual studies, but these studies themselves differ in philo- 
sophical approach, methodology and content. 


Methodological Issues 


COMMON DESIGNS Student development research is not developmental research 
in general. Wohlwill (148) has listed the tasks of developmental psychology 
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as: (a) determining the structural relationships and temporal patternings of 
change among sets of variables, and (b) specifying functional relationships 
between particular, situational, experiential, or organismic variables and selected 
parameters of change. The task of student development research is to study the 
relationship between situational and experiential variables and change in behav- 
ior, attitudes, and values during the college years while ignoring the change 
caused by organismic variables. 


THE LONGITUDINAL DESIGN Just as in developmental psychology, methodologi- 
cal problems beset researchers who study changes in students over time. Astin (5) 
criticizes much of the research conducted in this area and proposes multidimen- 
sional and longitudinal approaches. A longitudinal (or time series) design requires 
several measurements of student attitudes, usually over a period of several years. 
The process can be repeated as many times and for as long a period as is necessary. 
The advantage of this design is that subjects are members of the same cohort; that 
is, that they are all freshmen or sophomores at the same time, and accordingly 
have been exposed to similar experiences (e.g. the philosophy of the college was 
the same for all). In addition, a baseline for early behavior of the subject is 
established and can be subtracted from later measurements. 

However, there are disadvantages to employing this design. It is time- 
consuming, and it does not control for the effect of history (Campbell & Stanley 
29). History, or the effect of events extraneous to the college experience, may ac- 
count for differences between the first and second measures. For example, the 
gasoline crisis of winter 1974 may result in great changes in the behavior of last 
year’s freshmen, although such changes would be unrelated to these students’ 
college experience. There are also sampling problems inherent in the longitudinal 
design, since those who leave college are usually eliminated from the sample. This 
in turn limits the generalizability and interpretation of the results. 


THE CROSS-SECTIONAL DESIGN D freshmen and sophomores were both subjects at 
the same time and compared on some variable, results would be very different. 
This cross-sectional type of design confounds a number of variables, such as 
the year of the subject and the cohort. In terms of the college population it is also 
subject to sampling problems: the upperclassmen are not usually comparable to 
the freshmen samples because the designs do not take into account dropouts, 
transfers, and other shifts in the college population. This occurrence, called ex- 
perimental mortality (Campbell & Stanley 29) could produce effects which are 
confounded with the effect of the college experience. Unfortunately, this is a 
common design in student development research. 


MORE SOPHISTICATED ALTERNATIVES A noted developmental researcher, Schaie 
(127), suggests several alternatives to the longitudinal and cross-sectional designs 
which might be more effective in student development research. According to 
Schaie, responses are a function of the subject’s age (or year in school, in this case), 
the time the measurement is taken, and the cohort of the subject. The designs 
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described earlier usually only take into account one of these variables at a time. 
The designs Schaie suggests, while not considering all three variables, are an 
improvement in that two variables are considered simultaneously. 


THE TIME-SEQUENTIAL DESIGN The first design, the time-sequential, considers 
both the year of the subject and time of measurement. In a time-sequential design, 
for example, four groups of subjects might be used: (a) a group of subjects who 
were freshmen in 1974 and measured in 1976 when they were juniors; (b) a group 
of subjects who were freshmen in 1975 and measured in 1976 when they were 
sophomores; (c) a group of subjects who were freshmen in 1975 and measured in. 
1977 when they were juniors; and (d) a group who were freshmen in 1976 and 
measured in 1977 when they were sophomores. Year is taken into account, since 
there are two groups of each year, as is time of measurement, since some of the 
subjects are measured at the same time as the others, but at a different time than 
the other half of the sample. Cohort is not considered in this design. 


THE COHORT-SEQUENTIAL DESIGN Another new design is called the cohort- 
sequential. This design takes into account cohort and year but not time of mea- 
surement. Here again two groups from different years would be recorded as 
subjects. These would be (a) one group of subjects who were freshmen in 1974 and 
measured in 1976 when they were juniors; (b) one group of subjects who were 
freshmen in 1974 and measured in 1977 when they were seniors; (c) one group 
who were freshmen in 1975 and measured in 1977 when they were juniors; and (d) 
a group of freshmen in 1975 measured in 1978 when they were seniors. 


THE CROSS-SEQUENTIAL DESIGN The final design of this type is called a cross- 
sequential design. It takes into account the time of measurement and the cohort 
but not the year of the subject. For example, sophomores and juniors would be 
measured in 1977 (having entered in 1976 and 1975 respectively). Juniors and 
seniors would be measured in 1978 (having entered in 1976 and 1975 also). The 
subjects would not be of comparable years. 

So, although these designs are an improvement over the simple longitudinal 
design, they fail to consider one variable. Selection of a particular design depends 
on which of the three variables (time of measurement, year, and cohort) the 
researcher wishes to be confounded. 


ADDITIONAL VARIABLES Two additional variables which are not applicable to 
developmental research in general but are most important to student development 
research are intercollege comparisons and comparisons of students and non- 
students. The first variable suggests that by comparing the effect of various col- 
leges on students it is possible to learn about the college experience. The second is 
predicated on comparing the effects of college attendance to nonattendance. 
Unfortunately, most of the research in this area examines only college students 
and completely ignores those young adults who have not gone on to college. When 
these variables are considered, however, researchers should be careful that those 
variables mentioned earlier are also taken into account. 
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Statistical Issues 


Aside from these methodological techniques for improving student developmen- 
tal studies, there have been numerous statistical techniques suggested by re- 
searchers in the field. Astin (4) suggests a three-step model examining students at 
different universities. This model. considers the contributions of three types of 
variables: (a) input—those “talents, skills, aspirations, and other potentials for 
growth and learning that the student brings with him”; (b) the college environ- 
ment—“those aspects of the higher educational institution that are capable of 
effecting the development of the student”; and (c) output—“those skills, attitudes, 
and behaviors of students the higher educational institution either does influence 
or attempts to influence.” The advantages of this model, according to Astin, are 
several. In the typical study, an instrument is given to students over a period of 
time (longitudinal study). The difference between various administrations of the 
instrument is said to reflect the changes caused by the college experience. The 
problem with this assumption, as mentioned earlier and noted by Astin (8), is that 
“,.. any obtained relationship between educational practice and student output is 
necessarily ambiguous so long as no control is exercised over differential student 
input” (p. 301). His design eliminates the problem by taking into account student 
input. With his three-step model, Astin partials the effect of the input out of the 
output and also out of the college environment. 

Astin’s research model represents an improvement over many used in past 
studies in that it incorporates (a) students from a representative sample of colleges 
and universities instead of biased samples; (b) uniform measurement instruments, 
sampling techniques, and methods of subject identification to make data from 
different investigations interchangeable; and (e) both longitudinal and cross- 
sectional measurements of subjects to result in an approximation of the 
time-sequential design described earlier. Astin’s research utilizing this model was 
initiated in 1965 with a collection of student input information. The second stage 
of his model consisted of follow-up criterion data and data about environmental 
variables. This project has been ongoing ever since and has provided an enormous 
amount of information about college students. 

Astin (4) criticizes suggested statistical alternatives to his techniques. Matching 
students at different colleges is undesirable because the subsamples thus attained 
are unrepresentative of their institutions. There are also statistical problems in- 
herent in using actuarial tables to contro] differential input. 

Astin (4), aware of the shortcomings of his own design, concludes that no matter 
how good the design is, the investigator can be certain that he has isolated the true 
college impact only in those fortuitous situations where environmental variables 
are uncorrelated with the input variables in his multiple regression process or 
where the correlation between the environmental variables and the output vari- 
ables is much higher than that between environmental variables and the input 
variables. 

Feldman (57) suggests several alternatives to the Astin model. These include: 
path analysis (Feldman 57); Werts’ (145) method in which he partitions the 
explained variance in the outcome variable into components in order to examine 


354 WHITELEY, BURKHART, HARWAY-HERMAN & WHITELEY 


both the independent effects of the predicted variables and the joint effects; and 
Creager’s (46) method of partitioning variance of the output variables into the 
unique contribution of subsets of variables and the commonality of all subsets. 
Which method is to be used, Feldman concludes, depends largely on the focus of 
the research—whether it is correlational prediction or causal analysis. 


Philosophical Issues 


Moving out of the methodological and statistical arena, research in the area of 
student development also varies in terms of philosophical and theoretical stance. 
Most often studies seem to lack systematic theoretical development. Investigators 
appear to be more concerned with a particular subgrouping of the university 
population and how it reacts to a particular program at a particular university, 
than in advancing the body of knowledge of student development research on the 
impact of the college experience on the student. 

Some studies do consider the issue of impact, but few of those investigations are 
founded in theory. A limited number of studies do construct a theoretical 
framework on which to view student development research. This framework 
usually takes one of four major approaches: (a) the goals of higher education; (b) 
personality development of the student; (c) preparing students to occupy certain 
occupational and social roles; and (d) variation among colleges (Feldman 56). 


GOALS OF HIGHER EDUCATION Studies of the first type usually center around the 
academic achievement of students. These include Astin’s (8) study of the impact of 
dormitory living on students. Although this particular study goes beyond simply 
looking at the educational progress of the students into an examination of atti- 
tudinal and personality variables, it does provide a useful model. Results of the 
study indicate that at a four-year college, living away from home has positive 
effects on the student’s educational development. Students who live away from 
home are likely to be more successful than students who live with their parents on 
a number of dimensions: (a) they are less likely to drop out; (6) they are more 
likely to receive a high grade point average; (c) they are more likely to complete a 
baccalaureate in 4 years; and (d) they are more likely to apply to graduate school 
than are students who live at home. Dormitory living has somewhat of an advan- 
tage over living in a private room. This study attempts to look at the impact of 
college residential patterns on educational attainment. In another study, Hall (67) 
compares the academic performance of community college students with that of 4 
year colleges students and finds that the community college students fare better. 
After 4 years, however, the students who transferred to 4 year colleges do no better 
than do 4 year students. Unfortunately, to conclude that community colleges have 
a superior program to 4-year colleges would be erroneous because of methodo- 
logical inadequacies of Hall’s design. 

Teaching techniques are often investigated under this format. Leatherwood (99) 
investigated the effects of different teaching procedures on junior college students. 
Students found the lecture method to be the least effective when compared with 
leaderless discussion and role playing. Van Dyne (138) investigated the impact of 


COUNSELING AND STUDENT DEVELOPMENT 355 


experimental college on the students. He found that most learning in this situation 
takes place in small seminars, independent study, and off-campus field study. For 
some students the freedom was hard to handle, and for others this type of educa- 
tion resulted in greater self-discipline and better awareness of goals, limitations, 
and strengths. The investigator points to difficulties inherent in evaluating student 
growth in these programs. 


PERSONALITY DEVELOPMENT A second approach to student development re- 
search based upon a theoretical framework is to conduct studies on the personality 
development of students (Chickering 41). Chickering identified seven significant 
areas of development, showed the progression of development in these areas using 
students’ evaluation of themselves, and proceeded to discuss the types of programs 
which would foster change in these areas. Underlying Chickering’s research is the 
Piagetian concept that development occurs through a sequence of differentiation 
and integration. From this theoretical base, Chickering has then suggested ways of 
setting up college programs to foster development. More importantly, Chickering 
has isolated variables of student development: competence, management of 
emotion, autonomy, identity development, interpersonal development, integrity 
development, and goal development. 

One recently completed study (Barton, Cattell & Vaughan 13) does come closer 
than most to pinpointing change due to the college experience. It is also one of the 
rare studies which compares college students to those that do not attend college. 
Barton et al were particularly interested in specifying changes in personality which 
occur as a function of college attendance. 

Subjects in this study completed Cattell’s 16 personality factor questionnaire in 
1965 and in 1970 filled out another questionnaire designed to assess the major 
environmental events which occurred in each subject’s life between 1965 and 1970. 
On the basis of the second questionnaire, subjects were divided into a college 
group and a work group. 

Results indicate that college students score higher on the Cattell questionnaire 
both before and after college. For both groups, intelligence and radicalism factors 
increased and suspicion decreased. The authors conclude that these changes 
indicate age trends (naturally occurring development). 

For the factors of tendermindedness and imagination, the college group in- 
creased at a higher rate than did the work group, while the work group showed a 
significantly higher increase in self-sentiment than did the college group. The work 
group also showed a significant decrease in independence. 

King (93) was project director for an important study in the field of college 
student developmental research. Known as the Harvard Student Study, this was a 
longitudinal study of personality development during late adolescence and early 
adulthood. There were four objectives: 1. investigation of the process of personal- 
ity change and stability as this was related to personality structure and interaction 
with the college; 2. study of the socializing and allocating functions of the college 
as an Organization in society; 3. development of research methods and general 
research strategies; 4. development of theory. This latter point is particularly 
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important. The study was unable to proceed on the basis of hypothesis testing 
based on well-established theory, according to King, because such theory did not 
exist. 

The complexity of this interdisciplinary research (the research team consisted of 
representatives from sociology, statistics, clinical psychology, psychiatry, and 
psychoanalysis) is reflected in the translation of general aims into an operational 
framework consisting of six major variables. Independent variables were the 
organizational characteristics of Harvard College and the social, cultural, and 
personality characteristics of the staff and students. Role theory was utilized as a 
mediating mechanism between the student’s personality and the college environ- 
ment. Finally, criterion variables were patterns of student personality change and 
stability, stabilization and adaptation, and postcollege roles. A panel design was 
utilized to ascertain personality changes over time, including both survey and 
intensive case study approaches. An organizational analysis was undertaken 
which assessed the structures and functions of the college, with particular 
emphasis on the academic departments, residential units, and extracurricular 
organizations, It is well beyond the scope of this review to consider all of the 
contributions to basic student developmental theory which will be generated 
by this seminal work. 

King (93) published the first in a series of book reports on this project. The 
“continuity model” of evolutionary personality change which he postulated 
should provide a basis for hypothesis testing from established theory for subse- 
quent researchers. 

Research in other college environments which will generalize beyond the 
confines of the predominantly eastern selective college subculture should be a first 
task in future research. 


OCCUPATIONAL DEVELOPMENT The third area, the employment area, includes 
McCrea’s (101) study of the occupational outlook of students. Basically a survey of 
student expectations of educational and occupational attainment, this type of 
study provides little information about the impact of college. 


VARIATIONS AMONG COLLEGES Finally, studies of the college environment are 
more logically related to a conceptual framework. It is not always possible to 
interpret the correlations between variations in demographic features of a college 
and variations in student change. What is important is studying how the demo- 
graphic characteristics create conditions which in turn have an impact on the 
college student. For this reason, environments are studied. Baird (11) distinguishes 
eight different functions of college environmental measures which are related to 
three philosophies of measurement: (a) general research versus applied; (b) inter- 
versus intra-university; and (c) institution versus individual. Some of the common 
types include general intrauniversity studies of institution such as the College and 
University Environment Scale (CUES) by Page (114), and the College Charac- 
teristics Index (CCI) by Stern (130), indicating features and characteristics of 
college environments, and the general interuniversity studies of individuals (ACE 
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type) of Astin et al (8), in which norms of freshmen student characteristics across 
colleges are developed. Baird cautions that no research can possibly serve all eight 
functions. 

Because students’ responses to these standard college environment measures are 
in part determined by their own demographic characteristics (sex, year in school, 
major field, residence), some researchers have found it more expedient to partition 
the total population into several subsamples which can be compared, or to study 
one subsample of the university population and the impact the educational pro- 
cess has had on that group. As long as no attempt is made to generalize to a larger 
college population, some usefül information can be gathered from these data. 


Breakdown of the Student Population 


DROPOUTS As was mentioned earlier, most research in student development 
looks only at students in college without comparing the data to those gathered 
from students who have never attended college. There has been great interest, 
however, in studying students who were once in college but who no longer are at 
the time of measurement. Studies of dropouts are done, not in order to partition 
out college impact from general development, but to find out where colleges have 
failed. Cope & Hewitt (44) studied 1131 students who dropped out within 2 years 
of college enrollment. Since these investigators were primarily interested in stu- 
dents who chose to drop out, those leaving for involuntary reasons (e.g. illness) 
were excluded from the analysis. Their results indicate that the most frequent 
reasons given for leaving college were social, followed by academic, family, 
religious, financial, fraternity and sorority, and discipline. Morrisey (110) studied 
dropouts but only among freshmen. With this group, scholastic reasons were most 
often responsible for the attrition. Persisters were compared to dropouts and it was 
found that persistence was positively related only to high school rank and fall 
grade-point average in college. Zaccaria & Creaser (150), who compared grad- 
uates, achieving withdrawals, nonachieving withdrawals, and failures, found 
differences on high school rank and ACT between the former two as compared to 
the latter two. However, the Edwards Personal Preference Schedule shows that 
persisting students have somewhat different personal needs than those of students 
who withdraw. Taken together, these results suggest that, after an initial period\of 
adjustment to the academic rigors of college, social pressures become more im- 
portant in decisions of whether to stay in school or leave. Astin’s (6) national 
profile of dropouts seems to contradict this finding, His data indicate that the 
principal predictors of persistence were high school grades and tests of academic 
ability. However, his definition of persistence is more conservative than that of 
most other researchers—persisters are those who complete a BA in 4 years. The 
criterion therefore excludes all students who take reduced course loads, who “stop 
out”, or who need additional time to make up academic failures. It is not surpris- 
ing therefore to find that academic data provide the best predictors of those 
students who are indeed the top students. 
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Residents versus Commuters 


Comparing on-campus residents with nonresidents is another way in which stu- 
dent development researchers have partitioned the overall student population. 
Sauber (126) investigated the effect of place of residence on college adjustment at 
Florida State University. Place of residence did not affect the frequency and 
severity of problems studied. Those living at home or in university residence halls 
had slightly fewer problems with sociopsychological and personal psychological 
relations. Paolone (115) investigated the relationship between place of residence 
and maturity and found little relationship. He did find that those students living 
off campus were less dogmatic and authoritarian than those living on campus. 
Chickering & Kuper (42) compared educational outcomes for commuter and 
resident students. Results indicated that commuters and residents have similar 
academic experiences but that there are large differences between the two groups 
in terms of extracurricular activities and relationships with peers and parents. 
Commuters participate less in extracurricular activities and hold fewer leadership 
positions. They also have closer ties with their parents and fewer campus friends 
and fewer dating relationships. Over a 4 year period, commuters changed more 
often than residents in the direction of increasing their intellectual interests while 
residents changed more in nonintellectual measures such as complexity and au- 
tonomy. Finally, Astin’s (7) study, mentioned earlier, found that living away from 
home has positive benefits on the student’s education: these students were less 
likely to drop out and were more likely to apply to graduate school and receive 
high grade point averages. Living away from home also had consistently positive 
effects on students’ perceptions of their own interpersonal competency. Living 
- away from home increased the probability that students would be satisfied with 
the overall undergraduate experience. Astin concludes that institutions consider- 
ing abandoning dormatories may want to take a second look. 

It is difficult to draw conclusions about the impact that place of residence has on 
student development in view of these contradictory findings. Since, as discussed 
earlier, most of the methodologies employed by the above investigators are im- 
perfect, perhaps unequivocal results might yet be found. 


THE DISADVANTAGED STUDENT Aside from partitioning the student population in 
informative ways, student development researchers have also chosen frequently to 
concentrate on a particular subgroup of the total population. 

In the last few years, the disadvantaged student has received close scrutiny. A 
comprehensive study done by Astin et al (9) assesses the effectiveness of special 
programs designed to serve high risk students. The investigators were particularly 
interested in isolating those program components which promised to be of greatest 
benefit in educating the disadvantaged student. Although the study suffers the 
common drawback of not distinguishing between naturally occurring develop- 
ment and that development which is specifically related to college experience, it 
does isolate certain environmental variables which influence developmental out- 
come. Using the three-step model of student input characteristic, environmental 
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characteristic, and student output, the study determined: 1. Disadvantaged stu- 
dents are more likely to persist in a larger, more selective university than at a less 
selective institution. They are also more likely to stay at a 2-year college, but the 
important factors appear to be structure, faculty and curricula. 2. Black students 
and students of low socioeconomic background receive lower grades in college 
than those predicted on the basis of high school grades; however, if those same 
students went to selective colleges they would have received better grades than 
predicted. 3. Predominantly Black colleges had the effect of strengthening student 
interest in becoming an authority in a special field or a community leader. 4. The 
source of financial support is an important factor as to college satisfaction and 
persistence; scholarships are particularly effective in this regard, while loans had 
no effect. 

Some of the studies in this area focus on particular types of special programs, 
such as the Open Admission Program at the City University of New York, through 
which all New York City high school graduates are promised admission to one of 
the city’s 2-year or 4-year colleges. The program was under attack from many 
quarters following its inception, with the accusations being that it would lower the 
standards of the colleges and that Open Admission students were failing in larger 
numbers than regular admission students. Benjamin & Powell (15) counter these 
charges with data indicating that at least at one college the program was not having 
the expected deleterious results. While students admitted under the Open Ad- 
mission Program earned fewer credits during their first semester, 35% successfully 
completed the semester. The results are viewed as positive by the authors of the 
study. 

Other studies on disadvantaged students include Sedlacek & Brooks’ (129) 
‘attempt to predict academic success of students in special programs. Because of 
their largely Black sample, results are mostly applicable to Blacks. They found that 
SAT scores and high school grades did not correlate with college grades, and that 
better predictors were the number of credit hours attempted and a positive reac- 
tion to external control. They conclude that there are differences in the variables 
relating to Black student success and that these should be considered in the 
admission process. 


MINORITY STUDENTS ‘The impact of a college experience on minority students as 
a subgroup of the total student population has also been examined by several 
researchers. Bayer (14) reports on trend characteristics of Black freshmen as 
compared to non-Blacks. His results indicate few changes in trends among Black 
college freshmen since 1968 and a remarkable similarity between Black and 
non-Black students in predominantly White universities. Watley (142) investigat- 
ed the effects of marriage on college attendance among both Black and non-Black 
students. In general, married students are less likely to complete their freshman 
year than are nonmarried students. They are also likely to get poorer grades than 
their single peers. His findings also indicate that a greater portion of Black men are 
married than non-Black men. Watley (141) also looked at the effect of finances on 
college attendance among Black and non-Black students who had taken the 
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National Merit Scholarship Qualifying Test. Blacks had more scholarships, gov- 
ernment aid, and college loans, while non-Blacks received more parental aid, 
summer jobs, and savings. Students who did not attend college indicated that they 
needed money for at least one half of their college expenses. 

These studies delineate characteristics of students in this subgroup, but give us 
very little information about the impact college has had on this subgroup. Some 
research suggests ways in which the university may have some impact on the lives 
of its students. Burback & Thompson (27) studied alienation among Puerto Rican, 
Black, and White college students. They found that Blacks scored higher than the 
other subgroups on total alienation, higher than Whites on powerlessness and 
normlessness, and higher than Puerto Ricans on social isolation. The authors 
conclude that Black students face more contradictory norm patterns and feel less 
in control of their own lives than do other students. They also suggest that the 
university must work to reduce their negative feelings and change their attitude in 
order to help them adjust to the college environment. 


OTHER SUBGROUPS Other subgroups that have been studied by student de- 
velopment researchers include low-income students, “older” freshman, “low 
achievers,” and veterans. Holmstrom (77) looks at low-income students as com- 
pared to the other undergraduates. Using the Astin technique, she found that they 
are more likely to be found in public institutions, that they are more often Black 
and older, that their high school grades were generally higher than those of other 
students, but that in college they have slightly lower grade-point averages and 
fewer of them receive baccalaureates. Among low-income students, the sex of the 
student appears to be an important variable, in that more women in this category 
are likely to receive their degrees than are men. As in earlier findings, those 
low-income students at highly selective institutions are more likely to obtain the 
BA in 4 years than are those in less selective schools. For both low-income students 
and others, financial support is an important factor for academic success, but the 
low-income student is particularly dependent on support if he is to make satis- 
factory progress. 

Holmstrom (78) also studied “older” freshmen. She found that most of these 
students tended to enroll in public institutions in the liberal arts fields rather than 
technical fields. Usually they enrolled in small, less selective schools. Most were 
men and economically disadvantaged. Their high school grades were lower than 
those of the “typical” student, as were their college grades. In a third study 
completed by Holmstrom (79), low achievers were compared to “other” under- 
graduates. Low achievers were mostly men and tended to be older. In general, low 
achievers differed from other students in college experience, plans, achievements, 
and attitudes. 

Drew & Creager (52) studied the veterans who returned to college. Results 
indicate that veterans are older than the average freshman and from a lower 
socioeconomic status. More veterans were Black than White, most had poor high 
school academic records, and had lower educational aspirations than the average 
freshman. 
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Most of these studies are in the category of descriptive studies. The different 
variables that are studied are unrelated and thus lack meaning. Again, the impact 
of college is not directly the focus of the study, nor is it useful to speculate 
regarding the impact from the data presented. 


Student Development Research: Where It Stands 


While the general state of the literature is less than satisfactory, there are several 
positive aspects of student development research that need to be emphasized. 
Astin and his colleagues deserve credit for generating a great wealth of informa- 
tion on the college student. Their three-step model has been the basis for many of 
the studies completed in the last few years. Within this review, several studies 
employ this research model (5, 7-9, 14, 52, 77-79). Certainly the heuristic value of 
his model is great. 

The continued attempt by others in the field to modify the research methodol- 
ogy is encouraging. It is also encouraging that researchers such as King (93) 
are seeing the importance of studies of student development in a theoretical 
framework. This approach will hopefully lead to an elimination of the piecemeal 
approach that has occurred in the past. 
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A typical learning task involves a motivated organism that perceives a stim- 
ulus, reacts to it, and subsequently modifies its behavior. A complete analysis 
of the neurophysiology of learning thus involves aspects of sensory function and 
control of movement as well as aspects of motivation and drive. Indeed, the 
analysis of learning encompasses almost all areas of interest to psychologists, and 
the broad scope of the problem is reflected in the bewildering number of ap- 
proaches and techniques that have been applied to its study. In the present review, 
only a very limited selection of the total literature will be considered. An impor- 
tant basis for selection has been the degree to which it is possible from the research 
to draw conclusions regarding cellular mechanisms of neuronal plasticity. The 
review is divided into two main parts. The first part deals with research that 
attempts to study the cellular mechanisms of actual examples of behavioral mod- 
ifications, either habituation or associative forms of learning. The second part 
deals with model studies of neuronal plasticity, in which the behavioral relevance 
of the plasticity has not yet been directly ascertained. 

As a result of the emphasis on plastic mechanisms, the review primarily covers 
electrophysiological studies on invertebrates or simplified vertebrate preparations. 
By and large, with present day techniques, it is only in such preparations that it is 
possible to do the types of experiments that definitively establish mechanisms. 
Largely excluded from the present review are studies utilizing such techniques as 
spreading depression, inhibition of protein synthesis, and lesions. With few ex- 
ceptions, the interpretation of results of experiments utilizing these techniques 
depends upon a number of assumptions whose validity cannot be determined. 
Because of space limitations, the present review also excludes the numerous 
biochemical studies of learning that have appeared in the past few years. 

This review will emphasize the most recent findings, but in order to maintain 
some degree of historical perspective, studies within the past 5 years will also be 
considered. For this reason, the review will slighly overlap the previous chapter in 
the Annual Review of Psychology on the neurophysiology of learning (119). 
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As far back as the turn of the century, there was speculation that learning 
involves alterations of synaptic interactions between neurons. This hypothesis has 
been supported by numerous expériments demonstrating that synapses have a 
great capacity for plasticity, but only in the last 5 years has there been convincing 
evidence that synaptic plasticity mediates alterations of actual behavior. Not 
surprisingly, the first demonstrations of the behavioral relevance of synaptic 
plasticity involved studies of habituation, one of the simplest forms of behavioral 
modification. 


HABITUATION 


It is gratifying to see that there has been relatively little debate about whether or 
not habituation qualifies as “true” learning. There seems to be general agreement 
that habituation is an important class of behavior modification in the life of all 
organisms, and that habituation has many of the features of more complex forms 
of learning (e.g. 69, 98, 99). 


Vertebrate Studies 


The studies of Spencer, Thompson & Neilson (115, 115a) on decrement of the 
flexion reflex in spinal cats were the first to provide convincing data on the possible 
mechanisms of habituation. The flexion refiex has also been used to show that 
dishabituation is a superimposed facilitatory process not directly linked to habit- 
uation (115, 118). On the basis of studies of habituation and dishabituation, it has 
been postulated that every stimulus simultaneously evokes two independent pro- 
cesses—one leading to decrement, the other to increment of responsiveness (118). 

This simple idea provides a powerful explanatory principle for a ES number of 
experimental observations. 

On the basis of the research of habituation of the flexion reflex, seven possible 
mechanisms have been excluded. For example, habituation of the flexion reflex is 
not due to plastic changes affecting the excitability of the motor neurons (115a, 
127). Itis also unlikely that habituation is due to a type of presynaptic inhibition at 
the sensory fiber terminal (49, 127) and, in fact, indirect data suggest that it is 
unlikely that any type of active inhibitory process underlies habituation of the 
flexion reflex. One report (128), however, claims to have obtained evidence that on 
theoretical grounds suggests the operation of an active inhibitory process. These 
findings, however, could not be replicated in a later study by another laboratory 
(45), and the theoretical grounds for the argument have been questioned (55). The 
bulk of evidence suggests that habituation of the flexion withdrawal reflex in 
spinal cats is due to decrement of excitatory syrraptic potentials at the level of the 
interneurons in the circuit (115a, 128). Some indirect data argue against the 
possibility that significant decrement occurs at the first synapse (between sensory 
afferents and interneurons) in the flexion reflex of the cat. For example, decrement 
at the primary afferents would result in there being little or no generalization 
between receptive fields, and yet there is virtually complete generalization of 
habituation from one receptive field to another (45, 115). Furthermore, those 
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interneurons that appear to receive monosynaptic connections from cutaneous 
afferents do not show a decrease of evoked spike activity when repeatedly actı- 
vated (115a, 128). It should be noted, however, that in a study of generalization of 
habituation of the flexion reflex in humans with spinal transection, when a single 
test stimulus presentation was used to assess generalization, there was virtually no 
generalization, even when the test point on the skin was as close as 4 cm away from 
the point that was originally habituated (35). On the other hand, when generali- 
zation was assessed by means of repeated stimuli presented at one point following 
habituation at a nearby point, there was evidence of some generalization, since 
there was an apparent savings in the rate of habituation. Because of methodo- 
logical differences, it is difficult to compare various studies of generalization, but 
further studies of this phenomenon would be valuable and could provide insights 
into the mechanism of habituation. 

Consistent with the hypothesis that the plastic changes underlying habituation 
and sensitization of the flexion reflex are located at the level of the interneurons is 
the observation that the responsiveness of interneurons is parallel to the behav- 
ioral responses. Two classes of interneurons have been described (48). One class 
shows a purely decremental response with repeated stimulation, while a second 
class initially increases in responsiveness but eventually decreases. Behaviorally, 
with repeated stimulation the flexion reflex can initially show incrementing re- 
sponses (sensitization) before showing decrement (habituation) (118). Similar 
classes of neurons found in the spinal cord are also found in the reticular forma- 
tion (50), and it is likely that mechanisms of tactile habituation present in the 
spinal cord are also present in the brain. It seems likely that the mechanisms 
operating in lower animals also operate in humans, since on the behavioral level 
habituation and dishabituation of the flexion reflex in humans with spinal tran- 
section (35, 42, 43) is very similar to that of spinal cats. l 


Invertebrate Studies 


In order to study synaptic mechanisms most effectively, it is necessary to work with 
a monosynaptic pathway in which it is possible to stimulate the presynaptic unit 
and record from the postsynaptic unit without the mediation of polysynaptic 
pathways. In other words, it is necessary to locate both the presynaptic and 
postsynaptic unit at which the critical change mediating the behavioral plasticity is 
located. This has not yet been possible in the spinal cord, but it has been possible in 
several different invertebrates, including the crayfish, the sea hare Aplysia, and 
the cockroach. Habituation of the gill-withdrawal reflex in the marine mollusk 
Aplysia has been extensively studied by Kandel in collaboration with Carew, 
Castelluci, Kupfermann, and Pinsker (19-22, 78, 99, 100). Habituation in the 
crayfish has been studied by several investigators including Krasne in collabora- 
tion with Bryan, Roberts, Wine, and Woodsmall (72-74b, 131) and by Zucker, 
Kennedy, and Selverston (139, 141). Studies of habituation in the cockroach are 
not as complete as in Aplysia and the crayfish, but several investigators have 
recently made important contributions, including Callec, Guillet, Pichon & Bois- 
tel (18) and Zilber-Gachelin & Chartier (137, 138). The picture that has so far 
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emerged is surprisingly consistent, indicating that in all cases habituation is due, at 
least in part, to a decrease of excitatory synaptic efficacy at the central synapses of 
the sensory units in the reflex pathways. 


BEHAVIORAL STUDIES Habituation of escape responses to tactile stimuli have 
proved to be the most easily analyzed. These responses typically habituate rapidly. 
Like higher forms of learning, habituation can be retained for days and even 
weeks. In a very thorough and well-controlled study of habituation of siphon and 
gill withdrawal in Aplysia, Carew, Pinsker & Kandel (21) showed that habituation 
in this animal has several parametric features characteristic of higher forms of 
learning. First, they demonstrated that habituation in Aplysia can be retained fora 
very long period of time. At least 3 weeks after initial habituation experience, 
acquisition of habituation was more rapid than in control animals. Long-term 
habituation was produced by training animals daily with 10 trials a day for 5 days. 
It was also shown that, as is characteristic of higher forms of learning, retention of 
habituation was better following spaced training (10 trials/day for 4 days) than 
after massed training (40 trials in one day). A long-term effect was also seen when 
the spacing between groups of 10 trials was as little as 1% hours, thus permitting 
long-term habituation to be readily studied in an acute preparation. Wine and 
Krasne (73, 131) have found that the escape response in crayfish also shows 
long-term retention, and full recovery did not occur 24 hours after just 10 trials of 
previous habituation. 

Parallel to the vertebrate flexion reflex, a strong stimulus can produce increased 
responsiveness of either a habituated response (dishabituation) or a nonhabituated 
withdrawal response in Aplysia. Similar to habituation, sensitization of the with- 
drawal reflex of Aplysia can be retained for several weeks (100). Dishabituation 
also occurs in the cockroach (138), but probably does not occur in the escape 
response of crayfish (73). . 


PERIPHERAL MECHANISMS Habituation of tail-flip escape responses in the crayfish 
(73, 141) and of the gill-withdrawal reflex in Aplysia (22, 78) does not ordinarily 
involve peripheral changes either of the sensory end organs or of the muscle and 
neuromuscular synapse. A source of confusion in the literature on habituation is 
the fact that under certain specialized conditions, peripheral changes may contri- 
bute to response modifications in invertebrates. Thus, for example, in Aplysia if 
tactile stimuli are repeatedly applied directly to a pinnule of the gill instead of to 
the skin surrounding the gill, the withdrawal response of the pinnule habituates 
even if the central nervous system is removed (81a, 96). Under some circumstances 
these peripheral mechanisms may interact with central factors (86). The nature of 
the peripheral factors is not known, and it is not even clear whether or not the 
peripheral mechanisms involve peripherally located neurons or are mediated by 
direct excitation of muscle. The central mechanisms have proven more amenable 
to cellular analysis, and fortunately peripheral mechanisms can be bypassed by 
utilizing a weak or moderate stimulus applied to the receptive surface surrounding 
the gill (81a) instead of directly to the gill itself. Thus the gill can be involved in two 


NEUROPHYSIOLOGY OF LEARNING 371 


distinct reflexes: a gill-withdrawal reflex that habituates via central mechanisms, 
and a pinnule response that habituates via a peripheral mechanism. 

As in Aplysia, peripheral mechanisms may, in principle, contribute to reflex 
habituation in the crayfish, but they probably do not substantially contribute to 
normal habituation of the lateral giant escape reflex. Thus, for example, the 
neuromuscular synapse for one of the motor neurons of the escape response shows 
depression when stimulated at rates of one per minute or less (15). However, the 
decrement that may occur at this synapse appears to be offset by the facilitation 
that occurs at the neuromuscular synapses of other motor neurons involved in the 
escape response (139). 

Habituation of the withdrawal response to light in Pleurobranchaea appears to 
invole sensory adaptation, but central mechanisms are probably also involved 
(33). Finally, there are some data suggesting that a certain degree of sensory 
adaptation of some types of tactile receptors may accompany repeated tactile 
stimulation of cockroaches (137). This sensory adaptation is not sufficient to fully 
explain the response decrement observed in neurons in the central nervous system 
(137, 138), but it presumably does contribute somewhat. 


NEURAL CIRCUITS MEDIATING ESCAPE RESPONSES In order to study the mechan- 
isms of habituation, it was first necessary to study the neural circuitry controlling 
the reflexes that habituate. In Aplysia the motor neurons that innervate the organs 
involved in the defensive withdrawal reflex have been identified (79, 95). A pos- 
sible complication in the neural analysis of molluscan behavior is that molluscan 
“motor neurons” may not connect directly to muscle. It has recently been shown, 
however, that gill motor neurons in Aplysia produce discrete junction potentials in 
gill muscle (21a, 81a). It thus appears very likely that despite the presumed 
existence of an extensive peripheral “nerve net” in the muscles of Aplysia, the gill 
motor neurons make direct connections to muscle. In Tritonia, on the other hand, 
the motor neurons mediating the swimming escape response may not make direct 
connections to muscle (59). 

Gill motor neurons in Aplysia receive both a monosynaptic and polysynaptic 
excitatory synaptic input from the mechanoreceptors that mediate the reflex (17, 
22). The cell bodies of the mechanoreceptors are located within the ganglion 
containing the motor neurons, permitting simultaneous recording and stimulation 
of the sensory neurons and their follower cells—either motor neurons or inter- 
neurons that in turn excite motor neurons. | 

In the crayfish, the motor neurons mediating the tactile escape reflex may 
receive direct connections from tactile sensory units (73, 139), but their main input 
comes from the lateral giant fibers, which are a series of electrically connected 
neurons that function as a single fiber (141). The lateral giant fibers receive input 
either directly from sensory units via electrical synapses or indirectly from a small 
number of interneurons that produce excitatory synaptic potentials in the giant 
fibers. These interneurons, in turn, very likely receive a direct (monosynaptic) 
chemical excitatory input from mechanoreceptor units. 

In the cockroach, the neural circuit of the escape reflex has not yet been worked 
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out in detail. For many years it was felt that the escape reflex in cockroaches is 
mediated by the giant fibers of the ventral nerve cord, but recently it was shown 
that direct stimulation of the largest giant fibers does not result in an escape 
response (32). 


LOCUS OF PLASTIC CHANGES UNDERLYING HABITUATION AND DISHABITUATION In 
order to study the central neuronal changes mediating habituation, it was neces- 
sary to develop semi-intact preparations in which it is possible to simultaneously 
record behavior and intracellular neural activity. Such semi-intact preparations 
were developed independently for the study of gill withdrawal in Aplysia (79) and 
for the swimming response in Tritonia (129). In the semi-intact preparation in 


` ` Aplysia, it was shown that the short-term decrease of reflex responsiveness of the 


gill was due to progressively fewer spikes elicited in gill motor neurons by repeated 
tactile stimulation. The identified gill motor neuron L7 was studied in detail since 
its action could be shown to account quantitatively for a substantial (30-40%) 
proportion of the total gill contraction elicited by a tactile stimulus (81a). In some 
Aplysia neurons not related to the gill, repeated direct firing by means of depo- 
larizing pulses leads to progressively fewer spikes (80, 116a), but this is not the case 
for gill motor neurons (21a). Thus the decreased firing of L7 during habituation is 
not. due to decreased excitability of the neuron. Instead, decreased firing was 
shown to be due to a progressive decrease in the size of the complex excitatory 
synaptic potential that is evoked in L7 by tactile stimuli applied to the skin of the 
animal. In order to study the mechanism of the decreased synaptic input, a 
simplified preparation was used in which it was possible to record the monosyn- 
aptic potentials that mechanoreceptor neurons produce on interneurons and mo- 
tor neurons (22). These monosynaptic potentials progressively diminish in size 
when repeatedly elicited at low rates (22), indicating that at least part and perhaps 
all of the reflex decrement is due to a change occurring at the primary afferent 
synapse. When another afferent pathway was stimulated strongly, the same syn- _ 
apse that showed decrement showed facilitation. Thus neural changes mediating 
` both habituation and dishabituation in Aplysia were shown to have the same locus 
in the nervous system. In addition, changes at that locus also contribute to the 
phenomenon of sensitization, since dishabituation in Aplysia is a special case of 
sensitization (19). 

In crayfish, behavioral decrement was shown to be associated with a decrement 
of the later component of a two-component excitatory synaptic potential at the 
lateral giant fiber (72). The later synaptic component is a complex synaptic po- 
tential composed of individual potentials produced by several interneurons that 
are excited by tactile afferents (141). Further analysis showed that during habit- 
uation there was a decrease in the number of these interneurons fired by the 
afferent input (72, 139). The decreased firing of the interneurons in turn was due to 
a decrease of the monosynaptic excitatory synaptic input from mechanoreceptor 
afferents (140). Thus, both in crayfish and Aplysia decrement occurs at the first 
central synapses of the primary receptors. 

In the cockroach, the behavioral significance of neurophysiological studies is 
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less certain than in the crayfish or Aplysia, but there is evidence that habituation of 
the escape response is due to decrement of excitatory synaptic potentials at the 
central synapses of the primary afferent fibers (18, 137), as well as higher order 
synapses (138). Unlike Aplysia, dishabituation in the cockroach may not involve 
the primary afferent synapse, but may operate only at higher levels (138). 

> 
MECHANISM OF SYNAPTIC CHANGE DURING HABITUATION Since in the examples 
discussed, synaptic decrement occurs in a monosynaptic pathway in which only a 
single presynaptic element is stimulated, it is virtually certain that the decrement 
represents a depression of excitatory snynaptic transmission rather than active 
inhibition. Although presynaptic inhibition can occur at the central terminals of 
the mechanoreceptor neurons in the crayfish (74), such presynaptic inhibition does 
not mediate synaptic decrement during behavioral habituation. Instead, presyn- 
aptic inhibition appears to be active only during tail flips, and thereby prevents 
excessive decrement of the central mechanoreceptor synapses. 

Since habituation in invertebrates is not due to an active inhibitory process, it is 
unlikely that dishabituation is due to the blocking of inhibition. As was suggested 
in vertebrates (115a), dishabituation appears to be an independent facilitatory 
process (19, 22). The most parsimonious explanation of dishabituation in Aplysia 
is that it results from a type of presynaptic facilitation (67), although direct data 
are lacking. 

A major question concerning the mechanism of the decrement of excitatory 
synaptic potentials during habituation 1s whether the decrement is due to presyn- 
aptic or postsynaptic factors. A presynaptic cause of decrement implies that 
synaptic decrement is due to progressively less transmitter substance being re- 
leased for each presynaptic spike. Postsynaptic causes of synaptic decrement 
imply that there is a constant amount of transmitter release with repeated pre- 
synaptic spikes, but that the transmitter becomes progressively less effective in 
depolarizing the postsynaptic cell. Postsynaptic factors could include such things 
as changes in the resistance of the postsynaptic cell or desensitization of the 
postsynaptic receptors that react with the transmitter substance. Evidence from 
neuromuscular junctions and other model synaptic systems exhibiting synaptic 
decrement indicates that decrement usually results from a presynaptic process that 
operates to reduce transmitter output with repeated release (see p. 379). At cer- 
tain synapses, synaptic decrement (of unknown behavioral significance) may also 
be due to a postsynaptic process involving receptor desensitization (31, 122). In 
both the crayfish (140) and Aplysia (21b), however, the available data indicate that 
behaviorally relevant decrement is due to decreased transmitter release rather 
than receptor desensitization. This conclusion was made on the basis of a quantal 
analysis of the decremented and nondecremented excitatory synaptic potential 
(see p. 378 et seq on model studies for a discussion of quantal analysis). Because of 
technical difficulties in the crayfish (140), it was not possible to apply the most 
direct and powerful methods of quantal analysis. The method that was used 
(variance method) is based upon an analysis of fluctuations of the size of the 
synaptic potential, and this method is valid only if a number of underlying 
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assumptions are applicable. Unfortunately, it was not possible to directly assess 
whether these assumptions were indeed applicable. Nevertheless, the data ob- 
tained were consistent with the interpretation that the decrement was due to the 
release of less transmitter per presynaptic spike, i.e. there was a decrease of 
quantal content. 

A similar conclusion was reached in preliminary studies of the sensory-to- 
motor-neuron synapse that mediates habituation of the gill-withdrawal reflex in 
Aplysia (21b). At this synapse it was possible to apply more direct techniques for 
assessment of quantal content, and it could be shown that repeated presynaptic 
spikes released progressively less quantal units (m) of transmitter. In some exper- 
iments the magnesium ion concentration in the bathing solution was raised in 
order to decrease the release per presynaptic spike to an average of only one or a 
few quanta. Under these conditions, during successive presynaptic spikes, there 
was no change of the size of the voltage change produced by individual quanta (g), 
but there was an increase in the number of failures (lack of any release). An 
increase of failures argues very strongly against a postsynaptic mechanism of 
depression, such as receptor desensitization. 

The analysis of the synaptic mechanisms underlying habituation thus far have 
been limited to short-term habituation. For reasons that are not understood, 
habituation obtained in acute crayfish preparations exhibits only very brief 
retention, even though habituation of the same response in intact crayfish exhibits 
long-term retention of greater than 24 hours (73). On the other hand, electro- 
physiological evidence from the isolated abdominal ganglion of Aplysia indicates 
that this preparation can exhibit retention of synaptic decrement at gill motor 
neurons for a period of at least 24 hours (20). It should therefore prove possible to 
analyze the mechanism of long-term habituation of withdrawal reflexes in Aplysia 
and to gain important insights into the relationship of short-term to long-term 
storage processes. 


ASSOCIATIVE LEARNING 
Vertebrate Studies | 


Research on the mechanisms of associative learning has been done primarily on 
vertebrate preparations. In many invertebrates it has not been possible to dem- 
onstrate associative learning, and those invertebrates that exhibit the clearest 
examples of associative learning by and large do not have the most favorable 
nervous systems for study. 

The great majority of studies of vertebrate associative learning have not been ` 
concerned with cellular mechanisms of learning and consequently will not be 
reviewed in detail. These studies try to trace the flow of information from one 
neural structure to another during the course of acquisition, retention, and ex- 
tinction. In an attempt to separate secondary changes of neuronal activity from 
primary changes, a number of studies from the laboratory of Olds (e.g. 70, 109) 
have looked at those changes of single unit activity that occur with the shortest 
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latency following the presentation ofa conditioned stimulus. The interpretation of 
findings utilizing this method is highly dependent upon detailed anatomical 
knowledge, not only of what is connected to what, but also of the conduction 
velocity ofthe interconnecting pathways. At present such anatomical knowledge is 
not sufficient to provide unambiguous interpretation of experiments that utilize 
latency information to discriminate primary from secondary neural changes. 

An important series of recent studies by Woody and his collaborators (134, 135) 
have attempted to analyze the cellular mechanisms of classical conditioning in the 
cat. Although these studies constitute one of the most promising approaches 
toward identifying the nature of the plastic changes underlying associative learn- 
ing in vertebrates, because of statistical and methodological problems it is imper- 
ative that these studies be independently confirmed. 

A key element in the studies of Woody and collaborators is the physiological 
identification of the single units that are studied. By means of microstimulation 
through a microelectrode, cells (or small populations of cells) were classified as 
belonging to different motor systems. Confirming previous findings by others, 
closely spaced neurons within a single cortical electrode tract were found to project 
(polysynaptically) to distinctly different motor fields. In groups of cats classically 
conditioned with one or another response as the unconditioned response, there 
was a corresponding shift in the prevalence and in the threshold for activation of 
the type of cell whose motor projection corresponded to the conditioned response. 
Thus, for example, in cats trained to eyeblink (unconditioned stimulus, glabella 
tap) in response to a click (conditioned stimulus), a click stimulus activated more 
cells that projected to eyeblink muscles than to nose muscles, and vice versa for 
cats trained to exhibit a nose twitch to a conditioned stimulus. Thus, after condi- 
tioning, within a population of intermingled cells the conditioned stimulus tended 
to excite a specific subpopulation of cells more than other subpopulations. A sig- 
nificant finding from these studies was the observation that although with condi- 
tioning there was an increase in the number of motor-specific cells that responded 
to the conditioned stimulus, there was no consistent increase in the average 
number of spikes elicited per unit. Thus it was concluded that a response occurs 
when the total number of units activated by a stimulus exceeds a certain value 
(135). 

In addition to an increase in the number of responsive motor-specific cells in 
conditioned cats, Woody & Engel (135) found that there was a decrease in the 
electrical threshold for activation of cells projecting to the motor field of the 
conditioned response. Since electrical stimulation of the brain can excite cells 
either directly or synaptically by exciting presynaptic fibers, these experiments 
cannot determine whether the threshold change is due to presynaptic or postsyn- 
aptic changes at the neuron. 

In order to explore the nature of threshold changes during learning, Woody & 
Black-Cleworth (134) utilized intracellular recording techniques in an attempt to 
directly measure the threshold and input resistance of cells projecting to the motor 
field of the conditioned response. They found that in cats trained to eyeblink, eye 
projection motor cells had significantly lower threshold for firing than did nose 
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projection cells. Since in this case the electrode was intracellular, the threshold was 
measured by current limited to the stimulated cells, and it is therefore very likely 
that the threshold change represents a change in the neuron itself rather than a 
change in its presynaptic input. This represents the first systematic study which 
claims to have demonstrated a postsynaptic locus for a plastic change associated 
with learning, and as such deserves very careful scrutiny. Unfortunately, this study 
did not include either a nonconditioned control group or a group conditioned to 
make a different conditioned response. Consequently, it is impossible to unequiv- 
ocally eliminate the possibility that neurons that project to eye muscles are 
different from other cells, independent of whether the cat is conditioned or not. 


Invertebrate Studies 


INSECTS In 1962, Horridge reported that it was possible to train headless cock- 
roaches to position a leg so as to minimize an electric shock that was given 
whenever the leg dropped below some preset level. This type of learning appeared 
to be highly promising for neural analysis. The learning could occur even when all 
of the central nervous system was removed except for a single ganglion in the 
ventral nerve cord. Following the initial reports of learning of leg position in 
insects, there were a number of behavioral and physiological studies of this phe- 
nomenon (reviewed by Eisenstein 37), but progress in understanding the neural 
` mechanisms at a cellular level has been disappointing. Hoyle & Burrows (58), 
however, have recently reported a very promising development. They have iden- 
tified the locust motor neuron that was previously shown to undergo modification 
of tonic firing rate contingent upon an operant conditioning regimen (57). This 
neuron, the tonic motor neuron of the anterior aductor of the coxa (AAdC), could 
be impaled by intracellular electrodes for morphological and electrophysiological 
studies. By means of dye injection techniques, they studied the morphology of the 
AAdC (16). Unlike the other motor neurons studied in the metathoracic ganglion, 
this cell was devoid of branches on a particular segment of its axonal process in the 
neuropile. Intracellular recordings from the AAdC suggested that the spike acti- 
vity of this cell is not driven one-for-one by an underlying excitatory synaptic 
potential. Rather, the cell shows tonic activity that appears to be modulated by a 
background level of synaptic activity. These data indicate that modification of the 
firing frequency of this cell could be due to either a change in its tonic level of 
excitatory or inhibitory synaptic input, or to a change in the firing threshold of the 
cell. Tosney & Hoyle (120) have reported a new automated operant training 
technique for producing alterations in the firing rate of the AAdC. 


GASTROPOD MOLLUSKS The gastropod mollusks are ideal for cellular neurophys- 
iological study, but up until very recently there was no convincing evidence that 
they are capable of associative learning. Widely quoted studies of maze learning or 
classical conditioning in snails (see Willows 130 for review) on close examination 
reveal a total lack of appropriate controls for nonassociative forms of learning. Far 
better controlled than these early reports are recent studies of conditioning of 
feeding responses in the marine mollusk Pleurobranchaea (33) or the pulmonate 
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slug Limax (Gelperin, submitted for publication). It is nevertheless still premature 
to conclude that associative conditioning has been unequivocally demonstrated. 
These recent studies have not yet received independent replication, and addition- 
al controls would be desirable. In at least one instance, an attempt to replicate (77) 
a study of conditioning of feeding in Aplysia (63) was unsuccessful. In addition, 
some of the parametric features of conditioning in gastropod mollusks are very 
different from what is generally considered to be characteristic of conditioning in 
vertebrates. For example, conditioning in Pleurobranchaea (33) occurs even when 
there is zero time interval between the conditioned stimulus (tactile stimulation 
with a glass rod) and the unconditioned stimulus (extract of squid juice). The glass 
rod was dipped in the squid extract and then rubbed on the oral veil of the animal. 
Unfortunately, this technique of simultaneous presentation of the conditioned 
and unconditioned stimulus makes it impossible to do a control experiment in 
which an identical unconditioned stimulus can be presented unpaired with the 
conditioned stimulus. 

There is a preliminary report of neural correlates associated with conditioning 
of feeding responses in Pleurobranchaea (33). Although promising, these studies 
are not yet complete enough to permit any conclusions. 


PRELUDES TO ASSOCIATIVE CONDITIONING Several different features of nonasso- 
ciative behavioral modification in gastropod mollusks could provide an insight 
into associative forms of learning. Sensitization can be demonstrated readily in 
mollusks, and it.has been argued that sensitization may be an evolutionary pre- 
cursor of associative conditioning (124). 

Two features of habituation in Aplysia that may be relevant to associative learn- 
ing are greater retention with spaced compared to massed training (21), and less 
rapid habituation with strong compared to weak stimuli (99). These features may 
represent primordial elements of associative conditioning. It may well be that 
detailed understanding of the temporal and spatial features of nonassociative 
forms of learning will provide important clues to the mechanism of associative 
learning. 


MODEL STUDIES OF NEURONAL PLASTICITY 


Alterations in behavior presumably involve neuronal plasticity, or the property of 
neurons to undergo functional alterations. Until recently all of our knowledge of 
neuronal plasticity came from studies of “model” systems that were not involved 
in actual behavioral changes, or whose behavioral relevance was unknown. Al- 
though it is now possible in a few instances to analyze the mechanisms of neu- 
ronal plasticity known to mediate actual changes of behavior, model studies of 
plasticity still constitute an important source of data relevant to the neurophysio- 
logy of learning. Model studies help define the biologically meaningful boun- 
daries for theoretical treatments of the neurophysiology of learning. In addition, 
studies of experimentally favorable model systems, such as neuromuscular junc- 
tions, can provide the basic concepts and techniques for the study of less experi- 
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mentally favorable systems that are behaviorally relevant. There is an unfortunate 
tendency for some authors to utilize behavioral terms such as habituation to 
describe neurophysiological processes such as synaptic depression (e.g. 15, 137), 
thereby blurring the distinction between studies of established behavioral signifi- 
cance with model studies that have only theoretical significance. 


‚Protozoa 


Because of the great differences of the behavioral integrative systems between 
protozoans and metazoans, it can be questioned whether behavioral modification 
in protozoans should be termed leärning. Protozoans, however, exhibit response 
decrement (5, 93, 133) similar to metazoan habituation, and they may prove useful 
as model systems for what may occur at vertebrate synapses during habituation. 

There is good evidence that response decrement in protozoa is not due to fatigue 
of the motor apparatus (5, 133). Decrement may be due to the waning of some 
process associated with sensory transduction. By means of intracellular recording 
from Stentor, Wood (132) found that repeated mechanical stimulation evokes a 
progressively smaller prepotential. The prepotential ordinarily appears to trigger 
an action potential and associated movement. As the prepotential becomes 
smaller, it appears to no longer trigger an action potential and contraction fails to 
occur. 

Studies on the biophysical or biochemical basis of response decrement in pro- 
tozoa have largely yielded negative results. For example, there was no detectable 
increase or decrease of the total quantity of inorganic ions in unstimulated or 
stimulated protozoa (3). Similarly, alteration of protein synthesis does not appear 
to be associated with response decrement (2) and, in fact, decrement still occurs 
in cell ghosts of animals that have had their macronuclei removed (6). Apple- 
white & Gardner (3) have postulated that response decrement in protozoa is not 
due to an energy-consuming process, since the rate of decrement is constant 
over a wide temperature range. They suggest that alterations of the distribu- 
tion of magnesium or calcium stores within the organism may underlie response 
decrement. 

Despite continued attempts to demonstrate associative behavioral modifica- 
. tions in protozoa, there are still no convincing examples that have been generally 
accepted (see e.g. 4, 51). 


Synaptic Plasticity 


QUANTAL ANALYSIS At one time, studies of synaptic plasticity were largely of 
academic interest to psychologists interested in mechanisms of learning. The 
demonstration that synaptic plasticity can indeed mediate behavioral modifica- 
tions gives renewed significance to model studies of synaptic function, and it is 
becoming increasingly clear that psychologists interested in mechanisms of 
learning must understand synaptic physiology. An excellent elementary treatment 
of synaptic function is provided by Katz (67a), and several more technical reviews 
are also available (e.g. 60, 61, 76, 88a). To a psychologist, the literature on synaptic 
plasticity may appear highly complex and technica], but much of this literature 
can be understood on the basis of a few elementary concepts regarding the 
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“quantal theory” of Del Castillo & Katz (34). Quantal theory provides the 
concepts and theoretical tools for the physiological analysis of synaptic fu: 
and modification of synaptic function. The basic postulate of quantal theory 
the transmitter substance at the presynaptic terminal of a synapse is relea 
discrete packets. A synaptic potential of size V millivolts is composed 
number of quantal units each of which contributes g millivolts, that is, V : 
The release of each quantal! unit is a statistical process. In the resting sta 
probability of release (p) is very small, but invasion of the terminal by an. 
potential very briefly results in a large increase of p. 

Empirically it has been found that at many synapses, for relatively low 
of release the statistical properties of transmitter release follows a Poisson o 
to Poisson distribution. The discovery that the synapse contains morpholo; 
discrete synaptic vesicles led to a hypothetical identification of the constants 
Poisson equation with real entities. One notion is that if there is a populatio 
vesicles that can be readily released, and each vesicle has a probability p of 
released during an action potential, then m (the number of quanta releası 
action potential) = pn. The value of m will vary from spike to spike sinc 
determined by a probabilistic constant p. The frequency distribution of rer 
values of m will be Poisson if p is not too large, n is very large and the rele 
quanta are independent events. A variety of methods exist for estimatir 
values of q, m, n, and p at synaptic terminals, and an important thrust 
analysis of synaptic plasticity has been an attempt to characterize altered sy 
potentials in terms of alterations of these parameters. 

With rare exceptions, during the decade following the first description 
quantal hypothesis, it was found that both increases as well as decrea 
synaptic efficacy were due to a presynaptic factor, i.e. a change in m, reflec 
a change in the amount of transmitter released (61). As described earlier, o 
analysis has been used to help establish that habituation in crayfish and Api 
due to a decrease of quantal content (m) at the synapses of primary afferent 
During the past 5 years, in model studies of synaptic plasticity, there have 
many attempts to relate changes in m to a change of either the probabi 
release (p) or the number of releasable quanta (n). A number of investigator 
concluded that both increases and decreases of successively evoked sy: 
potentials can be due to changes of gither p or n or both (9, 25, 125). 

Complicating the analysis of synaptic plasticity in terms of changes in p 
is the suggestion that p and n may not be so readily translated into s 
representations of a quantity of synaptic vesicles and the probability of re 
For example, p may represent the combined probability of a vesicle moving 
release site and the probability of release from that site (140). 


CALCIUM AND INCREASED TRANSMITTER RELEASE Of the numerous hypo 
put forth to explain the mechanism of alterations of transmitter release (hat: 
following preceding release, the most viable has been the suggestion of K 
Miledi (68) to explain facilitation or increased transmitter release for a peri 
to approximately 250 msec following a single presynaptic spike. They sug; 
that facilitation is the result of residual calcium that remains in the presy: 
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terminal following the initial inflow of calcium during the first presynaptic spike. 
This hypothesis is based in part on the finding that at the vertebrate neuromus- 
cular junction the amount of transmitter release per presynaptic impulse is related 
to the fourth power of the external (and. presumably internal) calcium concentra- 
tion (36, but see also 29, 92). An exponential relationship between internal calcium 
concentration and transmitter release would mean that small changes of the 
concentration of calcium in the presynaptic terminal would produce large changes 
in transmitter release. 

The basic observation in support of the residual calcium hypothesis is that the 
amount of calcium present at the terminal during the first presynaptic spike 
determines the degree of increase of the amount of transmitter released by the 
second presynaptic spike. Both an early (68) as well as later stage of facilitation 
* (136) can be acounted for by the residual calcium hypothesis. It has been specu- 
lated that because of the great affinity of mitochondria for calcium, mitochondria 
may regulate neuronal levels of calcium (83) and may thereby help terminate trans- 
mitter release following the inrush of calcium at the presynaptic terminal (103). 

A role for calcium in mediating the relatively long lasting potentiation following 
a train of presynaptic spikes (post-tetanic potentiation) has been suggested on the 
basis of experiments similar to those studying facilitation. Increasing the external 
calcium ion concentration present during tetanic stimuli increases the duration of 
the following post-tetanic potentiation (105, 123). The maximum degree of po- 
tentiation, however, is not much affected, and there does not appear to be a good 
quantitative fit between observed potentiation and potentiation to be expected on 
the basis of the calcium ion concentration present during the tetanic stimulation. 

It has been claimed recently that synapses in the hippocampus can show very 
prolonged potentiation following stimulation of the perforant pathway (11, 12). 
Prolonged potentiation, in the range of hours, has also been reported at certain 
neuromuscular synapses in crustaceans (111) and at the synapse between the lateral 
column fibers and motor neurons in the frog (38). The role of calcium in very 
prolonged post-tetanic potentiation has not been studied. 

As well as calcium directly regulating the amount of transmitter release, there is 
evidence that calcium may regulate the rate of synthesis of transmitter agents. 
Banks, Magor & Mraz (8) have found that reducing the calcium ion concentration 
leads to a decreased rate of synthesis of noradrenalin. 


SYNAPTIC DECREMENT AND TRANSMITTER DEPLETION Until recently, most exper- 
imental evidence suggested that synaptic decrement was related to a depletion of n 
or the readily available store of transmitter quanta. For example, as would be 
predicted by the depletion hypothesis, at the vertebrate neuromuscular junction 
the degree of depression is directly related to the amount of transmitter released 
per presynaptic spike (117). 

The first evidence suggesting that simple depletion of transmitter stores cannot 
explain synaptic decrement came from a study of Kuno (75) on the monosynaptic 
excitatory connection between muscle afferents and motor neurons in the spinal 
cord. Previous studies indicated that this synapse shows decrement at rates of 
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stimulation as low as one every 10 seconds (84). If activity of the synapse led to 
transmitter depletion, it might be expected that the quantal content of each 
afferent terminal would be relatively large or n would be small. A quantal analysis 
of this synapse revealed that the actual quantal content per afferent terminal is 
very low, each fiber releasing an average of only a single quantum per spike (75). 
Furthermore, the size of the second of a pair of evoked synaptic potentials was not 
related to the size of the previous potential. An inverse relationship would be 
expected ifn was small and the transmitter store was easily depleted. It would be of 
interest to see whether the dorsal column synapse onto motor neurons also has 
properties similar to the muscle afferent synapse. The dorsal column synapse 
recently has been found to show decremental responses at low rates of stimu- 
lation and, similar to the muscle afferent synapse, the decrement cannot be ex- 
plained by an active inhibitory process (38). | 

Recent data on two other rapidly decrementing synapses also suggest that a 
simple decrease of n cannot explain synaptic decrement. Bruner & Kennedy (15) 
have found that the depression of synaptic efficacy at the giant-to-motor synapse 
in crayfish is unaltered (as a percent of initial value) even when transmitter release 
per presynaptic spike is greatly reduced by raising the magnesium ion concentra- 
tion in the bathing solution. Similar results have been obtained in preliminary 
studies of the behaviorally relevant decrementing synapse in Aplysia (21b). 

Despite the recent data, the actual mechanism of transmitter release is not well 
enough understood to completely rule out the possibility that synaptic decrement 
is due to a decrease of the amount of available transmitter, but it is becoming 
increasingly attractive to consider that decrement, especially at rapidly decre- 
menting synapses, is due to a decrease of the probability of release of a constant 
store of transmitter. An attractive feature of this hypothesis is that both synaptic 
increment and synaptic decrement could be explained by a similar mechanism 
involving alteration of presynaptic calcium levels. Just as increased presynaptic 
levels of calcium ions could, in principle, explain increased transmitter release, a 
decrease of presynaptic calcium levels could explain synaptic depression. 


CYCLIC ADENOSINE MONOPHOSPHATE Cyclic adenosine monophosphate (cAMP) 
has been identified as a substance that mediates many different cellular events 
triggered by external stimuli (46), and as such it is natural that it should figure in 
speculation regarding mechanisms of learning. In one symposium concerned with 
the role of cAMP in the nervous system, no less than four papers concluded by 
speculating that cAMP may be involved in both short-term and long-term mem- 
ory (13, 47, 54, 82). Several different transmitters have been shown to stimulate 
neuronal production of cAMP (46), and there is reason to believe that cAMP could 
have effects on the genetic readout of the cell (82), as well as on membrane 
properties (104). Particularly intriguing are the data suggesting that epinephrine 
may increase the release of acetylcholine at the neuromuscular junction by sti- 
mulating increased presynaptic levels of cAMP (112). On the basis of analogy to 
other systems, it has been suggested that CAMP may affect neuronal function by 
. altering membrane permeability or binding to calcium (104). cAMP could affect 


382 KUPFERMANN’ 


movement of calcium into organelles within the terminal such as microsomal vesi- 
cles (14, 112) or mitochondria. Alternatively, it might regulate movement of calci- 
um across the membrane, out of presynaptic terminals into the intracellular space. 


THEORETICAL MODELS The basic features of classical or operant conditioning can 
be realized by means of elementary neuronal plastic properties combined with 
very simple connectivity patterns of the constituent neurons (e.g. 44, 81). Several 
authors have pointed out that by means of appropriate circuitry, conditioning can 
result from a decrease as well as increase of synaptic effectiveness (44, 56, 81, 107). 
Speculation regarding the role of decreased synaptic effectiveness in learning is of 
added interest given the fact that at present the only documented neuronal 
mechanism of a behavioral change is a decrease of synaptic efficacy (see pp. 
373-74). Further, many of the trophic interactions between neurons, as discussed 
below, involve decreases of synaptic effectiveness or connectivity. 


TROPHIC INTERACTIONS AND CONTINGENT ALTERATIONS OF SYNAPTIC INTER- 
ACTION Since associative learning involves alteration of behavior as a func- 
tion of some contingent relationship between stimuli or responses, researchers 
have been interested in whether or not long-term changes of synaptic effectiveness 
of a presynaptic element can be contingent upon the activity either of the post- 
synaptic unit or of other presynaptic terminals on the same postsynaptic element. 
A number of experiments suggest that contingent modifications of synaptic func- 
tion is possible, but in no case have such modifications been causally related to 
` learning. 

One intriguing possibility of an interaction between presynaptic and postsynaptic 
activity arises from the observations of Lomo & Rosenthal (85) and Cohen & 
Fischbach (27). These workers have found that lack of activity of muscle leads to a 
spread of acetylcholine sensitivity over the whole muscle membrane, and that 
conversely, activity of the muscle results in shrinkage of active receptor areas to the 
region of the end plate. Thus the spread of receptor sensitivity following dener- 
vation of muscle or neurons (supersensitivity) may not be due to the removal of a 
trophic substance contained in the nerve, but rather may be due to lack of activity 
of the postsynaptic element. Stent (116) has speculated that reversal of the sign of 
the membrane potential during the action potential of a neuron (or muscle) 
“destroys” receptors on the surface, but that the postsynaptic receptors immedi- 
ately under the region of terminals that have been active are spared because this 
region is not as severely depolarized by the postsynaptic action potential. This 
mechanism could provide the physical basis of the Hebb hypothesis of associative 
learning. Only those synapses that have participated in firing the postsynaptic cell 
would remain effective, whereas synapses not involved in firing the neuron would 
become ineffective due to the disappearance of active receptors. 

Although there are data that suggest that the spread of receptor sensitivity 
following denervation is not due to the removal of a trophic substance (as was 
previously believed), some experiments suggest that under certain conditions lack 
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of a trophic substance in nerve may also result in spread of active receptors (64). 
Harris has recently reviewed the trophic functions of nerve cells (52). 

Several experiments suggest that a trophic substance in one nerve terminal may 
affect the spatial distribution of other terminals. Aguilar et al (1), for example, 
have found evidence consistent with the hypothesis that axoplasmic flow within a 
neuron continually provides the terminal with a substance that diffuses out and 
functions to prevent the invasion of nearby terminals. When axoplasmic flow ofa 
nerve was blocked, the sensory field ofthe nerve appeared to be “invaded” by the 
terminals of nearby nerves. Marotte & Mark (88) have reported evidence (care- 
fully confirming older findings) that one synaptic terminal on a muscle fiber can 
cause other terminals on the same muscle fiber to become nonfunctional. Nerves 
innervating eye muscles of fish were sectioned and then permitted to regenerate. 
When a nerve not normally innervating a muscle made synaptic contact, the 
muscle showed reflex activity appropriate for that nerve but inappropriate for its 
normal function. Shortly after the muscle became innervated by its normal nerve, 
the abnormal reflex disappeared and was replaced by normal reflex activity. A 
reasonable interpretation of this result is that inappropriate synapses can function, 
but are made nonfunctional by more appropriate synaptic connections. 

There is increasing evidence that in the central nervous system the existence of 
one set of synaptic terminals on a neuron prevents the invasion of other terminals, 
and that if the normal input to a neuron is removed or prevented from forming, 
terminals from other systems will fill in the area devoid of its normal synaptic 
terminals (e.g. 87, 91, 103a, 108). On the basis of electrophysiological (87) and 
behavioral evidence (108), it appears that such abnormal connections are func- 
tional. 


MORPHOLOGICAL CHANGES Up until approximately 5 years ago, there were very 
few reports of synaptic morphological changes due to experience or synaptic use. 
Recently a large number of studies have appeared reporting various types of 
morphological changes. One group of studies has examined short-term changes 
(lasting on the order of minutes) as a consequence of relatively brief periods of 
synaptic activity. Another group of studies, usually done with young animals 
whose nervous system is in the process of development, has examined long-term 
changes (lasting days or weeks) in response to injury or long-term exposure to 
environmental stimuli. 


Short-term changes The most dramatic short-term morphological changes in 
synaptic terminals are the result of very high levels of stimulation (23, 41, 65, 71, 

89, 97) or conditions in which the metabolic activity of the presynaptic terminal is 
impaired (7, 65, 94). But changes have also been reported under more physiolog- 
ical conditions (101, 102). Alterations reported include increases or decreases of 
the number of synaptic vesicles and their volume (65, 71, 89, 97, 102), alterations of 
the morphology and position of nerve-terminal mitochondria (65), increases of the 
amount of external membrane at the terminal (53, 101), and increases of the size of 
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synaptic boutons (62). The data on ultrastructural changes at synapses following 
use provide added support for the notion that transmitter release consists of a 
process in which synaptic vesicles fuse with the presynaptic terminal and release 
their contents into the synaptic cleft. Although there can be morphological 
changes at synapses following use, it is by no means established that any altera- 
tions of normal synaptic function is related to such morphological changes. In fact, 
there are already data that suggest that short-term synaptic decrement need not 
necessarily correlate with presynaptic morphological changes (66). 


Long-term changes Sotelo & Palay have reported ultrastructural evidence of 
terminal degeneration in normal brains, and they suggest that constant death and 
regrowth of terminals may be a normal state of affairs (114, see also 26). Further- 
more, they have seen ultrastructural evidence indicating active growth of den- 
drites, and have hypothesized that the postsynaptic as well as presynaptic elements 
may be involved in a constant renewal process (113). Although there are no direct 
physiological data demonstrating support of this hypothesis, several morphologi- 
cal studies have found long-term changes of both presynaptic and postsynaptic 
structures. Early studies on the visual system reported alterations of the size and 
number of synapses or of the branching patterns of dendrites following damage to 
afferent fiber systems or following long periods of sustained exposure to abnor- 
mally low levels of afferent input (28, 30, 121). Fifkova (39, 40) reported mor- 
phological changes of synapses in the visual cortex of rats raised with one eye 
sutured closed. Since diffuse light can still penetrate through the sutured eyelid, 
the morphological changes may be due to a lack of pattern vision rather than lack 
. of light stimulation per se. These results, as well as others (30), indicate that the 
nature of morphological changes induced by environmental conditions cannot be 
expressed by simple generalizations. For example, axodendritic synapses in the 
cortex receiving the projection from the sutured eye were decreased in number but 
increased in size. By contrast, axosomatic synapses decreased in size, particularly 
those synapses containing round as opposed to oval vesicles (39, 40). 

The visual system of the cat appears to be particularly sensitive to envi- 
ronmental contingencies up until about 4 months of age. The morphological, 
physiological, and behavioral effects of unilateral lid closure in cats are most 
readily produced in cats no older than 4 months, and during this same period, 
substantial reversal of the effects of unilateral closure can be produced by revers- 
ing the eye that is sutured closed (10). However, there is evidence that under 
appropriate conditions at least some degree of recovery of visual function can be 
produced in cats that have had unilateral lid closure from birth up to a year (24). 

Recently changes of synaptic morphology (39, 90, 126) or dendritic branching 
(47a, 110) have been correleated with more normal variations of environmental 
stimulation. One line of research has demonstrated morphological correlates 
related to relatively mild variations in the degree of environmental complexity in 
which young animals are raised. Alterations of both synaptic (90, 126) and den- 
dritic (47a) structures have been reported. On the whole, these changes closely 
parallel the earlier as well as more recently reported changes of gross cortical 
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morphology or biochemical characteristics related to environmental complexity 
(106). Some of the cerebral effects of environmental complexity have been found 
in adult animals (104a), strengthening the possibility of relating these effects to 
learning rather than to development. 


CONCLUSION 


During the past 5 years there has been a great acceleration of cellular studies of 
mechanisms of learning and behavioral plasticity. As is usually the case in science, 
few recent studies have revealed dramatically new principles, but older isolated 
observations or hypotheses have received substantial confirmation or rebuttal. 
Particularly significant has been the movement toward relating model studies of 
short-term synaptic plasticity to short-term alterations of behavior. 

Alterations of synaptic efficacy have been definitively related to short-term 
alterations of behavior in invertebrates. The available data indicate that similar 
alterations underlie short-term behavioral changes in vertebrates. An important 
advance in the analysis of the neural mechanisms of learning will be to determine 
if the existing model systems of long-term neural plasticity are relevant to long- 
term behavioral modification. Most model studies of long-term neural plasticity 
have been done in developing nervous systems or in neural systems recovering 
from damage. It will be a challange to demonstrate whether or not the existing 
examples of the capacity of the nervous system to undergo long-term plastic 
changes have relevance to behavioral modifications, or whether such changes 
“represent unique adaptations of growth or response to injury. An additional 
challenge for future research will be to determine to what degree the neural 
mechanisms of nonassociative learning such as habituation apply to associative 
forms of learning. 
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This year may mark the beginning of the major turnaround in personality study 
we have long awaited. Not that “progress” is compellingly visible; flaws noted by 
previous reviewers continue: “sprawl and diversity” abide, as does that chilling 
absence of larger theoretical aims and synthesizing concerns which Adelson (1) 
deplored so eloquently. Yet there are grounds for predicting the rebirth of a solid 
personology, and the reviewer’s task must now be that of nurturing the promising 
new life. Six years ago Adelson construed the task as largely reportorial, and 
regretfully focused on mainstream research at the expense of studies that were 
“off beat or ahead of their time.” Here I have chosen the opposite approach; 
working within drastically reduced page limitations, I have neglected a good deal 
of competent mainstream research to focus upon developments which seem most 
promising for the future. This selectivity surely betrays my own bias; but the use of 
personal intellectual convictions is in itself one of the promising trends to be 
reported., 

Most observers would agree that personology, in any nontrivial sense, virtually 
disappeared during the 1960s. To be sure, a number of investigators managed to 
maintain a steady concern with “real” personology, the elaboration of compara- 
tive theories of personality flourished, and the proliferation of psychometric ap- 
proaches continued unabated. However, as an area of inquiry, the field was largely 
swamped by the vigorous development of experimental social psychology, con- 
quests in cognitive-developmental work, the burgeoning technology of behavior 
modification, and the celebrations of humanistic ideology. That we now confronta 
period in which personology seems likely to be rediscovered—when, if it did not 
exist, it would have to be invented—is owed more to successes in some related 
fields, and to failures in others, than to the coherent development of personology 
itself. Within general psychology, the ascendance of biological-evolutionary and 
cognitive approaches have been liberating forces, along with the collapse of the 
laboratory-experimental method as the major source of knowledge in social and 
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developmental psychology. Beyond psychology, the pace of social change—espe- 
cially the impact of Black consciousness, women’s liberation, and the countercul- 
ture—has produced searching reexaminations of our traditional assumptions. 

Personality research is difficult to organize, even in the most tranquil times. One 
must deal with structural, dynamic, and developmental questions—and from the 
perspectives of general, differential, and individual psychology. The task is com- 
pounded during periods of major paradigm-shifts, since any contribution may 
simultaneously speak to quite different substantive, conceptual, and methodolog- 
ical issues. Here the presentation is organized on the basis of my intuitive 
judgment of “importance.” Conceptual and methodological issues now lie at the 
forefront of personology; if there is disappointingly little to report by way of 
“advances” in substantive research, the empirical literature may serve to mark 
problems of importance and possible ways of redirecting inquiry. 


REORIENTATIONS IN THEORY AND METHOD 


Far more significant than any specific advances during the past year is the distinct 
change in the climate of psychological research. Widespread dissatisfaction with 
dominant modes of inquiry—the reliance upon simple, linear, quantitative, causal 
models for prediction of overt behaviors—has ushered in a period in which a 
serious personology may flourish. The year has seen several proposals for funda- 
mental reorientations in conceptualization and inquiry. Spence (89) offered a 
provocative contrast between digital and analog descriptions of behavior, noting 
the sensitivity and flexibility of analog systems, and their particular relevance for 
fields of inquiry where the molar phenomena are more important than their 
molecular ingredients. 

Urging the need for a thorough conceptual exploration of personality descrip- 
tion, Alston (3, 4) contributed an extremely useful analysis of the distinction 
between T-concepts (essentially “trait” notions, defined by frequency of responses 
over a range of situations) and D-concepts (“desires” and other “deeper-lying” 
dispositions). T’s and D’s are shown to have opposite strengths and weaknesses in 
nomological relations to behavior; an analogy is the contrast between a language 
(D’s) and a signaling system that lacks a grammar (T’s). Importantly, Alston 
observed that: “... T’s and D’s do not fit together into a single coherent scheme. 
Hence in developing a model for personality description we must choose between 
them, and as far as a theory of basic underlying structure is concerned, the 
preference must be for D’s.” This welcome gift from a philosopher is worth the 
serious attention of personologists. 

Two papers by Olweus (71, 72) advance similar conceptual distinctions in 
reporting a program of research on aggression. Olweus argued (and demonstrated 
empirically) that it is necessary to distinguish between theoretical and observable 
variables, between action and inhibitory tendencies, between situational and 
habitual components of behavior. Noting that a simple, linear model proves 
inadequate, he warned that such powerful linear techniques as multiple regression 
and factor analysis may be of little value when not guided by theoretical consid- 
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‘erations. In a series of studies of aggression in boys, markedly different rela- 
tionships were obtained ‚between projective and overt aggressive responses for 
boys with different levels of inhibition, but both projective and inventory ap- 
proaches could predict aggression with considerable accuracy. There may exist a 
good deal of consistency on a genotypic level, despite absence of phenotypic 
consistency. Olweus’s findings strongly support his contention that “prediction is 
not enough, and cannot substitute for understanding and explanation” (71, p. 
315). 

Still another voice urging a compatible message is Livson’s (63) discussion of 
developmental dimensions of personality. The life-span developmental perspec- 
tive, he suggests, offers a necessary corrective to approaches which have confused 
substantive and methodological issues. Such an approach is necessarily concerned 
with continuities, seeks to identify genotypes which have eluded investigators 
preoccupied with phenotypic data, must necessarily establish typologies, rely 
upon life-record data (observations and interviews), and readmit clinical judg- 
ments and rating methods as indispensable tools of inquiry. 

While the message of current contributors strongly challenges the value of 
familiar trait conceptions and quantitative, linear measurement approaches, the 
issues are by no means resolved. Cattell (16) offered a comprehensive treatment of 
personality and mood as assessed with questionnaires. Although much of this 
volume is directed toward technical issues and internal conflicts among workers in 
the psychometric tradition, Cattell, as usual, has organized an extremely broad 
domain, providing a conceptual structure which includes the hierarchical organi- 
zation of dispositions, their course through the life history, manifestations in 
pathology, and much more. The volume deserves to have considerable impact 
beyond the immediate circle of questionnaire afficionados. 


Person, Situation, Interaction 


Pseudo-issues manage to consume as much time and effort as real ones, and few 
controversies have seemed so noisy in recent years as the situationists’ attack on 
the legitimacy of “personality” variables in psychological science. The controversy 
seems to have dissipated during the period of this review, and the “disposition vs 
situation” issue resolved in mutual agreement that “interactional” frameworks are 
required. A good deal of rather naive and redundant argument has been put into 
perspective by Bowers’ (11) timely and useful analysis. Examining the metaphys- 
ical, psychological, and methodological biases of situationism, he noted the 
consequences of the misidentification of an S-R viewpoint with the experimental 
method, and of the nearly exclusive reliance upon laboratory experimentation as a 
mode of inquiry. His perceptive analysis of the recent literature should be useful 
to many workers. 

Reviewing the entire “disposition vs situation” controversy it seems astonishing 
that so much effort and intelligence has been expended in attacking or defending a 
narrow “trait” conception of personality. Serious personologists have never con- 
strued behavioral consistencies as defining personality. Situationists, holding a 
somewhat different scientific ideology, have tended to misconstrue belief in per- 
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sonality dispositions as somehow equivalent to “person blame.” Hopefully, we 
have passed through an awkward age, and are now belatedly prepared to search 
earnestly for keys to understanding individuality. 

While the triumph of a Lewinian “interactionist” view comes as a welcome 
relief, it is not entirely clear how such a research program is to be theoretically 
guided. Neither of the two most vigorous research traditions with serious claims in 
this area—those identified with attribution theory or with locus of control—appears 
to hold great promise. Attribution theory, ably represented by Jones et al (51), 
among many others, tends to focus rather narrowly upon cognitive processes 
under situational constraints. The disregard of preexisting subject variables, and 
of more complex cognitive-affective processes, seems to limit what current attri- 
bution theory research may contribute to personology. Locus of control literature, 
although conceptually dealing with an important personality variable, has been so 
instrument-bound as to offer little theoretical clarity. Undoubtedly, this is the 
single most popular topic in current personality research, so one hesitates to offer. a’ 
cavalier and perhaps premature dismissal of this work. At this point, the internal- 
external control variable appears to be one of the fads which periodically cap- 
tures the field. Collins and his associates (22, 23), placing the internal-external 
metaphor in a broader context, explored empirically alternative explanations of 
the I-E work. They concluded that at least four separate dimensions of individual 
difference are involved, and that the metaphor can be (and should be) decom- 
posed. Recent work with multidimensional assessment of I-E appears to generate 
firmer relationships with outside criteria. However, refinement of I-E assessment 
methods would not seem to transcend basic limitations. Until fundamental issues 
are addressed with greater theoretical depth and methodological breadth, current 
locus of control research seems unlikely to clarify critical questions about person- 
situation relationships. 


In Search of Personality Theory 


A most serious problem remains the absence of any widely shared, comprehen- 
sive, and dynamic theoretical framework capable of posing significant, research- 
able questions or tying together research findings. The resulting fragmentation 
in the field, with its unsatisfying array of researches guided by “middle-range 
theories,” scientific ideologies, or simply the availability of popular ideas and 
instruments, is troublesome enough; a further problem is our doubtful capacity to 
assimilate advances in neighboring fields which pose exciting challenges to the 
personologist. Implications of work in biology and ethology (e.g. right-left brain 
functions, evolutionary adaptedness of dreaming, of facial expression, and so on), 
in anthropology, psycholinguistics, and cognitive-developmental psychology have 
posed an implicit agenda for personology. Our most urgent task may well be that 
of the difficult, systematic, theoretical stock-taking we have avoided in recent 
years. 

Psychoanalytic theory, which served this guiding, integrating function through 
many years of personological inquiry, was just such a comprehensive and dynamic 
formulation. To an unacknowledged degree, psychoanalytic theory still guides 
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research—if in an implicit, diffuse fashion—in the choice of problems (e.g. ego 
development) or research variables (e.g. defenses). Despite considerable disillu- 
sionment.with aspects of Freudian theory (for example, the energy model and the 
treatment of sex differences, which stands among its more dramatic “failures”), 
the sheer scope and depth of the theory—and its viability in contemporary ego 
psychology—remain invaluable to personologists. Jahoda’s (49) thoughtful anal- 
ysis of the mutual challenges of social psychology and psychoanalysis concluded 
that the controversial position of psychoanalysis must be Kept alive. Stoller (90) 
recently reviewed Freudian formulations of psychosexuality in the light of recent 
biological inquiry, noting both the revisions required and the stability of the basic 
structure. Kline’s (53) assessment of the status of psychoanalytic propositions 
yielded a rather positive verdict. This work, which stands as an updating of Sears’ 
` assessment 30 years ago, will be a useful resource for many workers. More re- 
cently, LeVine (59) explicitly invoked psychoanalytic theory and method in his 
reformulation of the culture-and-personality field. He offered an exceedingly 
useful review of the earlier culture and personality work, a critique of personality 
assessment methods, along with a proposal for a “population psychology” in 
which the method of clinical psychoanalysis could be translated into a program for 
cross-cultural work. While this last feature may raise doubts among many 
personologists, LeVine’s analysis of contemporary personality research is percep- 
tive, and his ideas worth serious attention. 

The promise of a “new” theoretical framework of complexity and power may be 
found in the accelerated rediscovery of Jungian thought. The past year has seen 
the publication of the first volume of Jung’s letters (2), the Freud-Jung 
correspondence (66), and the appearance of several new books presenting Jung’s 
theory to various audiences (38, 78, 86, 91). The very bases for psychologists’ 
earlier neglect of Jung’s work now seem to commend it to our attention: the 
emphases upon nativism, symbolism, the intrinsic duality in human nature, and 
upon the proactive quality of inner experience are obviously compatible with the 
emerging Zeitgeist. Doubtless our students—and the ubiquitous “intelligent lay- 
man”—are mainly responsible for the Jungian revival. Jung’s own writings, like 
those of his current interpreters, tend more toward metapsychological and thera- 
peutic issues than toward providing an explicit framework for empirical inquiry. 
Thus considerable effort may be required in aligning Jungian theoretical insights 
with appropriate research procedures. However, this task seems both feasible and 
worthwhile, as several recent contributions demonstrate. 

For a number of years, Helson’s researches into creative styles have invoked 
Jungian thought in delineating the personal qualities and imaginative processes 
involved in creative work. Two new papers (39, 40) provide excellent examples of 
the use of Jung’s theoretical ideas in organizing psychometric data to capture the 
subtle relationships between dimensions of literary fantasy and the personality 
and work style of the author. Among several provocative implications of Helson’s 
work are the possibilities of integrating Rankian and Jungian concepts, and the 
use of Jung’s typology—and concepts of “anima” and “animus”—in accounting for 
masculine and feminine styles of creative work. 
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Jung’s theory of psychological types, while representing only a part of the entire 
theoretical apparatus, may prove to be the aspect of Jungian thought most acces- 
sible to (and testable by) academic psychologists. For the mainstream psycholo- 
gist, Jung’s type theory plays a role comparable to that of Freudian “defense 
mechanisms” for earlier generations: the constructs offer a coherent descriptive 
system which accords well with (and helps to organize) the individual differences 
one encounters in real life, and may be adopted somewhat independently of a 
commitment to the entire theory. Unfortunately, the type theory seems to be 
widely misunderstood as mostly involving “introversion vs extraversion,” so that 
the dynamic complexity of the typology has been little explored. Carlson & Levy 
(14) proposed the type theory as a framework for studying person-situation in- 
teractions, and reported a series of empirical studies directed toward its construct 
validation. Using rather conventional research methods to test derivations from 
Jungian theory, they found that introverted-thinking and extraverted-feeling 
types (assessed via the Myers-Briggs Type Indicator) differed in performance on 
two types of short-term memory tasks; intuitive-perceptive types were more ac- 
curate than sensing-judging types in interpreting facial expressions of emotion; 
and extraverted-intuitives were overrepresented among social service volunteers. 

Research on Jungian typology may be greatly facilitated by the avail- 
ability of a standardized instrument—the Myers-Briggs Type Indicator (69)—which 
combines intuitive appeal to subjects with psychometric stability (60) and an 
increasingly firm record for construct validity. Hopefully, publications from an 
ongoing program of typological research at the University of Florida (M. 
McCaulley, personal communication, December 1973) and from intensive work 
by the test developer (I. Myers, personal communication, December 1973) may be 
available to future reviewers for the Annual Review of Psychology. . 

What of other theories of personality traditionally discussed in our textbooks? 
Adler’s influence was noted in a new commemorative volume (68) and in the 
development of a measure of the social-interest variable (36). McDougall’s sem- 
inal ideas are presented in Boden’s (10) scholarly discussion of purposive ex- 
planation in psychology. Humanistic and social learning viewpoints motivated 
several papers reviewed in later sections. However, one discovers that there are 
rather few formulations of sufficient breadth, depth, and complexity to qualify 
as “theories of personality.” Alternative viewpoints seem to represent different 
emphases, or to be largely identified with particular variables, particular methods. 
The complementary, rather than competing, nature of alternative formulations is 
a core notion in Cartwright’s (15) new textbook which develops “realistic eclec- 
ticism” as a framework for teaching the basic personality course. 

Meanwhile, exploration of metatheoretical issues promises to enliven the liter- 
ature and should ultimately contribute to conceptual clarity and methodological 
advance. Prominent are recent discussions of dialectic (81), positivist vs organicist 
(75), and relational (64) models, along with such unpublished work as Tomkins’ 
(S. S. Tomkins, personal communication, April 1974) formulation of “power vs 
purity” in models of inquiry. 
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A SELECTIVE REVIEW OF PERSONALITY RESEARCH 


Ideally, it should be possible to organize a year’s output of personality research in 
terms of a coherent conceptual framework. This has not been possible, and the 
research literature is here grouped under somewhat arbitrary and overlapping 
thematic labels. Regrettably, several important topics are entirely neglected and 
others treated incompletely—largely from space limitations which required the 
exercise of the reviewer’s (clearly fallible) judgment as to the “promise” of current 
work. 


Ego Development, Moral Development 


Personologists’ concern with truly molar levels of analysis is reflected in continu- 
ing work on ego development and moral development. Haan et al (37) attempted 
to assess the interrelationships among three formulations of development and 
process--Loevinger’s model of ego development, Kohlberg’s moral development 
sequence, and Haan’s model of ego functions—in a study of hippies. While the 
sample was too small and too special to permit a thorough exploration of theoret- 
ical issues, there emerged a useful picture of the dynamics of hippie youth 
Coping ego functions were related to level of moral development, but not to ego 
development; if anything, ego development appeared to be related to increasingly 
“successful” defense. Hippie youth appeared to be in a transitional stage beyond 
convention, but not yet principled in moral reasoning or conscientious in ego 
development. The investigators offered new ideas about the “fit” of normative 
theories to persons who have chosen a socially dissident, morally uncertain, and 
personally hazardous life style. 

Erikson’s formulation of psychosocial crises was explored in a study (73) which 
described a new ego identity status—“alienated achievement”—and modes of 
coping with the intimacy vs isolation crisis among college men. One in “alienated 
achievement” status seems to equal the identity-achievement person in ego 
strength and degree of identity resolution, but bases his identity more on inter- 
personal relationships than on matters of occupational or ideological choice. 
Three major “intimacy” statuses were identified and related to identity statuses. 
Tentative support emerged for the basic premise that genuine intimacy occurs 
only after a reasonable sense of identity has been established. In another study 
(93), college women of different ego identity statuses were presented the Asch 
conformity task. Those in “stable” identity statuses (achievement and foreclosure) 
conformed less than those in “unstable” statuses (moratorium and diffusion). 
Erikson’s conception of basic trust as the outcome of successful resolution of the 
earliest psychosocial crisis motivated a study of trust and academic achievement 
(47). Girls were generally more trusting; the child’s trust of the teacher and the 
teacher’s ratings of trust were independent of intelligence and significantly related 
to academic performance of school children. 

A rapprochement with Erikson’s theory appears in Kohlberg’s (55) recent essay 
on moral development. Kohlberg has dramatically shifted from his earlier con- 
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tention that moral development ended with adolescence; stages of principled 
moral reasoning are now seen as appearing first in early adulthood. Further, there 
is sketched a hypothetical seventh stage of moral development corresponding to 
the basically religious Eriksonian stage whose outcomes are a sense of integrity vs 
a sense of despair. Kohlberg’s discussion of theoretical issues involved in 
integrating structural stages and ego development stages poses many stimulating 
ideas. In the structural model, stages are different structures for a single function 
(e.g. moral development, logical reasoning); in Eriksonian theory, stages represent 
choices or uses of new functions by an ego. In structural theory, developmental 
change is primarily a changed perception; later stages are more cognitively ade- 
quate and more universally applicable. In contrast, Erikson’s theory involves 
developmental change as a self-chosen identification with goals, resulting in rela- 
tively permanent choices or commitments; later stages are more adequate, not in 
cognitive inclusiveness, but in their ability to order personal experience in more 
stable, positive, and purposive form. The basic differences are those of logical vs 
psychological formulations; as Kohlberg notes, an integrated theory of social and 
moral stages would attempt to combine the two perspectives. 

An alternative model of moral conduct and moral character has been intro- 
duced by Hogan (42), who assumes that moral conduct is fundamentally “irra- 
tional” and that differences in even such clearly cognitive phenomena as moral 
judgments derive from more basic personological structures. Five dimensions are 
proposed: moral knowledge, socialization, empathy, autonomy, and a dimension 
of moral judgment (ethics of conscience vs ethics of responsibility). He offers 
conceptual and empirical definitions, developmental antecedents, and empirical 
findings supporting each of the five dimensions. Elsewhere (41, 43) Hogan has 
placed the problem of moral development in a dialectical framework and outlined 
a role-theoretical perspective to the problem of compliance. 

Fishkin et al (33) examined the relationships between moral reasoning and 
political ideology in an attempt to replicate and extend findings from earlier 
Berkeley studies of student activists. College students on eight campuses, studied 
during the Cambodia-Kent State crisis, responded to the Kohlberg moral dilem- 
mas and rated political slogans. “Law-and-order” moral reasoning was strongly 
related to conservative ideology; preconventional subjects favored violent radi- 
calism, while postconventionalists rejected conservative views but did not accept 
radical ideology. 

Moral development inevitably involves consideration of altruism and prosocial 
behavior—a topic which continues to command much research interest. The past 
year produced a “great” study which restores one’s faith in the experimental 
method by the competence and care with which basic issues were considered in the 
conduct of research and interpretation of findings. This is a study by Yarrow and 
her associates (102) of preschool children’s “learning concern for others.” Exper- 
imental treatments were based on analyses of natural socialization experiences in 
which parents may simply teach principles or may also exemplify them in practice, 
and where either procedure may vary in the warmth and nurturance of relation- 
ships. In constructing four experimental environments, several weeks were taken 
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to establish the experimenters as meaningful adults in the children’s natural 
(nursery school) settings. Various modes of presenting distress situations were 
developed; baseline tests of helping behavior preceded exposure to experimental 
histories of high or low nurturance; and were followed by posthistory evaluation 
before introduction of training conditions; both immediate and long-range tests 
for retention of experimental influences were made; and basic findings were 
replicated in a second experiment with children from a different socioeconomic 
background. (Surprisingly, one flaw in this otherwise exemplary study was the 
failure to report sex-composition of the sample or analyses for sex differences.) 
Findings were complex, but unequivocal: symbolic altruism increased in all ex- 
perimental groups, but was unaffected by adults’ nurturance; nurturance in- 
creased altruism in nonpressured, realistic encounters with distress, and children 
with nurturant caretakers who had modeled helping in both symbolic and live 
distress gave more help, verbalized more sympathy, and were more consistent in 
their altruism. 

The elegance and clarity of Yarrow’s design and results are matched by the rich 
and insightful discussion of theoretical and methodological issues in altruism 
research, and useful extrapolations to “real life” settings. This is a landmark study 
which sets standards for future work in the area. 


Masculine, Feminine 


Nowhere is the impact of social change more striking than in psychologists’ 
increased concern with sex differences in personality. Psychosexuality, although a 
central construct in personology, has been ignored in much research, or treated 
simply in terms of normative conceptions of “sex roles.” The consciousness-raising 
among psychologists induced by Women’s Liberation has produced a torrent of 
critical, substantive, and sometimes polemical work. The year’s harvest may be 
captured under two rubrics: the “psychology of women” (treated in a separate 
chapter of this volume), and conceptual and methodological issues in personality 
research arising in the study of psychosexuality. 

Implicit “masculine bias” in the structure of psychological inquiry has been 
noted with increasing frequency over the past few years. Carlson (13) addressed 
the impoverished conceptions of personality reflected in recent empirical work, 
and suggested that dominant modes of inquiry are “agentic” (masculine) in 
Bakan’s (5) sense; the complementary development of “communal” (feminine) 
styles of research are required for investigation of important problems. Among 
deficiencies noted in current work, three interlocking issues were proposed for 
future personality research: duality in human nature (the coexistence of “op- 
posite” tendencies within the individual), typology (attention to qualitative 
‘constructs as a corrective to dimensional thinking), and biological bases of indi- 
viduality. 

The significance of a dualistic approach to masculinity-femininity was under- 
scored by Constantinople’s (24) review of M-F assessment literature. Her close 
scrutiny of major tests of M-F sharply questioned three basic untested assump- 
tions: (a) that M-F is best defined in terms of sex differences in item responses; (b) 
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that it is a single bipolar dimension ranging from extreme masculinity to extreme 
femininity; and (c) that it is unidimensional in nature and can be adequately 
measured by a single score. Some recognition of the “bisexuality” assumed in 
psychodynamic theories (e.g. Freud, Jung) now seems equally demanded by 
psychometric and theoretical imperatives. 

The power of a theoretical framework, when wedded to a broad-gauged meth- 
odology, is nicely illustrated by Jeanne Block’s (8) elegant summary of longi- 
tudinal and cross-national studies of sex-role socialization. Invoking Bakan’s 
conceptions of agency and communion and Loevinger’s formulation of ego devel- 
opment, Block succeeded in charting the course of development under varying 
personal, family, and societal conditions. Among other important findings, her 
work reveals: striking cross-national consistency in demands for agentic qualities 
in boys and communal qualities in girls (but with predictable overemphasis on 
“agency” in both sexes in the more capitalistic American society); a corrective 
emphasis upon the father’s role in psychosexual development for both sexes; and a 
clear tendency for more mature levels of ego development to involve the integra- 
tion of agentic and communal qualities. 

This line of inquiry was extended in another study (9) exploring data from the 
California longitudinal studies. Using Socialization and Femininity scores from the 
California Personality Inventory as the basis of a fourfold typology (high-low so- 
cialization, appropriate-inappropriate sex typing), Block et al examined person- 
ality data from adult years and earlier family and childhood data in discovering 
rather different “recipes” for sex-role socialization. A differentiated concep- 
tion of identification emerged, along with convincing evidence that traditional 
sex-role typing for men appears to extend the personal options available, while for 
women this has tended to restrict alternatives of action and expression. Among the 
emergent patterns, that of an “androgynous” identification—in which neither 
parent presented a stereotypic sex-role model, but both provided models of com- 
petence, tolerance, and shared responsibilities—appears to define a developmental 
course both salutory in its consequences and in tune with current social trends. 

Psychological formulations of sex-role socialization (as of most important 
issues) may inevitably carry normative implications. If “appropriate” masculine 
or feminine sex-typing was the message urged in recent decades, it is rapidly being 
replaced by an “androgynous” norm which asserts that both masculine and fem- 
inine modes of experience and action are involved in optimal development. Bem 
(6, 7) has developed an adjective checklist distinguishing masculine, feminine, and 
androgynous sex typing, and has offered early evidence on its construct validity. 
Her work with collegiate samples indicates that these options can be assessed 
reliably, and that (with a few wrinkles) androgynous subjects of both sexes are 
more competent in performing experimental tasks associated with both stereo- 
typic masculine and feminine roles. The notion of psychological androgyny has 
become increasingly influential since Rossi’s (80) early paper; thus it deserves a 
more searching conceptual analysis than the current literature affords. To what 
extent are androgynous subjects more task-oriented than ego-oriented in a 
Lewinian sense? Is psychological androgyny more usefully construed as a 
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“unisex” escape from normative constraints, or as the development of (bisexual) 
potentialities within a secure gender identity? Are there risks that an androgynous 
norm may prove as constrictive as earlier norms of “appropriate” sex typing (e.g. 
Bem’s characterization of feminine women who “flunked” experimental tasks as 
showing “behavioral deficits”)? Might more complex dualistic formulations (e.g. 
Bakan’s constructs of agency and communion, Jung’s conceptions of anima and 
animus) enrich inquiry on psychological androgyny? Such questions, of course, 
must be answered by future research. 

Socialization is but part of the pattern of psychosexuality; there remains the task 
of integrating new biological knowledge with current psychological formulations. 
An important work by Money & Ehrhardt (67), which lives up to its subtitle, 
Differentiation and Dimorphism of Gender Identity from Conception to Matur- 
ity, provides an extremely useful integration of concepts and data from genetics, 
embryology, endocrinology, neurosurgery, social anthropology, and psychology. 
The publication of the American Psychopathological Association symposium on 
contemporary sexual behavior (105) offers several valuable formulations of bio- 
logical aspects of sexuality and their implications for psychological inquiry and 
therapeutic work. 


The Biology of Personality 


In an era when the biological-evolutionary viewpoint amounts to a “revolution” 
(27) in general psychology, we may expect that personologists will increasingly 
attend to psychobiological work for the enrichment and extension of our inquiry. 
Within the past year, the most prominent lines of influence involve the assimila- 
tion or accomodation of work in genetics, psychophysiology, and morphology; on 
the personality side, these deal with aggression, temperament, extroversion, and 
obesity. . 

Jarvik et al (50) reviewed the evidence bearing upon the rare but theoretically 
important XYY genotype as a key to understanding human aggression. While 
many unresolved issues were noted, the authors suggested that the XYY genotype 
may be seen as highlighting the association between maleness and violence, and 
that the power of this approach to aggression lies in its basis in an inherent 
predisposition, along with the search for a solution in an understanding of envi- 
ronmental variables. Inheritance of temperaments was examined in a study (12) of 
127 pairs of monozygotic and dizygotic twins (aged 4 months to 16 years), with 
questionnaire assessments of emotionality, activity, sociability, and impulsivity 
provided by members of the Mothers of Twins Clubs in eight states. The findings 
argued strongly for a genetic component in the four temperaments, with a single 
exception (impulsivity in girls); heritabilities were generally higher for boys than 
for girls, and the impact of the environment was also strongly suggested by the 
patterning and age trends in the correlations. 

A Pavlovian view of “nervous typology” appears in both British and American 
work. Claridge and his associates (17) reported an ambitious twin study aimed at 
examining both psychophysiological and genetic bases of individual differences, 
and testing a theoretical model developed originally with psychiatric patients. 
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Personality questionnaires, cognitive tasks, and physiological measures were ad- 
ministered to a group of 95 pairs of normal adult twins; correlational and factor 
analytic methods examined issues dictated by the underlying theory. While the 
findings were too complex for summary here, several implications of the results 
may be noted. “Tonic arousal” and “arousal modulation” were identified as 
components of psychophysiological patterns which could be used as a basis for 
nervous typological classification and meaningfully related to dimensions of 
neuroticism and psychoticism. Arousal modulation appeared to be strongly in- 
fiuenced by heredity, and tonic arousal apparently subject to considerable envi- 
ronmental variation. With respect to more familiar personality variables, the 
findings were complex and perhaps disappointing to the investigators: genetic 
bases of extraversion appear mainly via the “sociability” component, and there 
was no evidence for any marked genetic contribution to anxiety/neuroticism; 
genetic contributions to cognitive variables appeared mainly via general intelli- 
gence. The thrust of the work, however, supported the underlying conceptual 
approach: that differences in the interaction between the same set of nervous 
typological processes are crucial, and that the problem is one of discovering the 
varieties of nervous typological organization rather than of specifying the partic- 
ular causal process which is separately responsible for each personality dimen- 
sion. 

“Strength of nervous system” in Pavlov’s sense was linked to “need for stimu- 
lation” in two studies by Sales et al (84). Strong subjects (“reducers”) presumably 
need higher levels of stimulation, while weak subjects (“augmenters’”) perform bet- 
ter at lower levels of stimulus input. Substituting a measure of auditory threshold 
for the usual kinesthetic aftereffects task, the investigators confirmed predic- 
tions over a wide range of dependent variables. Strong (high threshold) subjects 
were more likely to prefer complex stimuli, become bored with simple tasks, drink 
coffee, have been raised in urban environments, and group social stimulus figures 
more densely, as compared with weak subjects. Somewhat compatible findings 
come from another study (52) relating personality variables to tolerance to an 
irritant compound. 

While in no sense biologically based, Coan’s (18) thorough exploration of 
personality factors in smoking is of some relevance here. He noted that smokers 
“... tended to show more tolerance, if not actual hunger, for varied ideas, emo- 
tional and perceptual effects, complexity, and perhaps even confusion”; smokers 
tended to be more extraverted than nonsmokers, although this rule applied more 
markedly to women, and the links between smoking and extraversion seem to 
require more thorough analysis of the nature of extraversion. 

The possibilities, as well as the problems, in psychophysiological approaches 
to personality are set forth in Gale’s (34) review of the extraversion-and-EEG 
literature. Psychophysiology (which he sharply distinguishes from physiolog- 
ical psychology) considers simultaneously behavior, subjective experience, and 
physiological state, and thus examines the “whole man.” Nearly four decades of 
research on extraversion-EEG relationships, however, have yielded contradictory 
findings, largely through failures to consider the conceptual and practical demands 
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of the three component fields. Gale outlined necessary precautions to be taken in 
future work, along with some alternative paradigms (those associated with vigi- 
lance, social interaction, stimulus complexity, and imagery) which may be devel- 
oped to yield useful data. Particularly important—as is clear in his analysis of 
existing work—will be greater concern with conceptual clarity and attention to 
subjects’ phenomenology. 

Clearly, there are potentially important relationships between much of the 
biologically oriented work on “temperament” and the cognitively oriented work 
on “style.” As yet these relationships cannot be described with much assurance, 
but it seems safe to predict lively attention to this problem area in the near future. 

From social and phenomenological standpoints, few personality data are more 
“biological” than body build. Morphology appears in the current literature mainly 
through the continuing interest in obesity. Here current concerns are with ex- 
plaining behavioral differences between obese and normal persons in terms of 
responsiveness to external cues (76, 77) or in terms of deficit in response inhibition 
(88). 

Evolutionary and ethological approaches continue to gain influence in psychol- 
ogists’ thinking. The impact may be clearest in the growth of work on nonverbal 
communication—an area nicely presented in a new anthology by Weitz (97)—and 
more specifically in studies of eye contact and of emotional expression. For 
example, Ellsworth & Carlsmith (32) specifically drew upon work in ethology in 
formulating the role of gaze aversion in aggressive encounters. (Their very com- 
plex findings from a complex research design are inevitably more provocative than 
definitive. The basic finding that human subjects try to eliminate or avoid aversive 
eye contact of a victim reflects both ethological and social psychological interpre- 
tation.) 

The centennial anniversary of Darwin’s basic work on emotional expression 
prompted an important volume (30) tracing the history of this field and presenting 
conceptual and methodological issues from several perspectives. Ekman’s intro- 
ductory chapter offers a useful case study of the bases for the rejection and later 
rediscovery of Darwin’s work; other chapters present the evolution of facial 
musculature and its role in emotional expression, an account of recent primate 
work, the history and unfinished business of developmental inquiry on emotional 
expression, an analysis of cultural universals and cultural constraints, and a dis- 
cussion of theoretical and methodological issues in Darwin’s work. 

Implications of the biological perspective for personology go far beyond the 
brief summary possible here. If the importances of biological thinking is not as yet 
matched by clear indications of how we may best exploit advances in these 
neighboring areas, this may prove to be a blessing in at least two ways: toward the 
enrichment of personology and its contribution to other fields. Personologists have 
every reason to welcome and to understand the “biological revolution”—which, 
after all, affirms the importance of organismic variables—and to formulate ser- 
iously and critically our own biological assumptions. At this point, well-formu- 
lated ideas and disciplined speculation may be more important than data or tight 
designs. (For example, might not the “opponent process” model of neural organ- 
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ization (45) suggest an approach to the study of duality in human nature?) Further, 
several areas of psychobiology appear to welcome clearer definition of psycholog- 
ical variables which could better focus their inquiry. [For example, what might 
personologists contribute toward identifying those genotypes useful in shifting 
quantitative genetics toward a true behavior genetics (96)?] While it is far easier to 
pose questions than to suggest answers, the biological field will clearly com- 
mand—and deserve—sharply increased attention. 


Personality Style 


“Style” offers a potential key for capturing individuality in a dynamic and truly 
personal way, as Allport long held. Yet it is difficult to arrive at a conceptualization 
which clearly differentiates stylistic variables from abilities, defenses, tempera- 
ment, and a good many other constructs; viewed at close range, these constructs 
interact and interpenetrate in disconcerting ways. Indeed, perhaps style is that 
interaction. The notion of style as “how” one acts, thinks, feels—as contrasted with 
“what” one does, thinks, or feels—must serve. Several studies included in this 
section, however, might equally claim attention elsewhere. 

One of the most promising efforts to capture personal style comes from Rice’s 
(79) continuing study of voice qualities. Vocal style has intuitive appeal as a 
potentially important variable: it is a directly behavioral and “unobtrusive” 
measure, elicited in natural settings, and one which has been reliably assessed and 
related to therapeutic outcomes. In contrast to recent concerns with microscopic 
linguistic analysis of nonverbal communication, Rice classified a limited number 
of vocal styles on the basis of a “melodic line”—pitch range, tempo, emphasis 
patterns, as well as (reliably assessed) judgments of inwardness, instrumental 
quality, involvement, and the like, Three such vocal styles, previously identified in 
a study of clients in Rogerian therapy, were related to pretherapy Rorschach and 
MMPI data. A “focused” style (high energy, narrow pitch fluctuation, irregular- 
ities of tempo, a quality of inward-turning), an “externalizing” style (high energy, 
wide pitch range, regular stress patterns, a “talking-at” quality), and a “limited” 
style (low energy, narrow pitch range, even tempo, weak stress patterns, and a 
fragile, distancing quality) were related in predicted ways to Rorschach scores 
associated with creative functioning; as predicted, they were unrelated to Ror- 
schach scores reflecting normative-adjustment criteria or to MMPI scores. 

This line of inquiry was extended in Wexler’s (99) study relating voice quality to 
cognitive processes (differentiation and integration; vividness and variety) as- 
sumed to reflect ongoing processes of self-actualization, as well as to a more 
conventional measure (the Personal Orientation Inventory) of self-actualization. 
Subjects were undergraduates describing feelings when experiencing various 
emotions. Again, a “focused” vocal style was strongly associated with both the 
meaning-structure and questionnaire measures of self-actualization; “externaliz- 
ing” vocal style was negatively related to both types of measures, and a “limited” 
style showed weak negative relationships to both. This approach to assessment of 
vocal style seems promising for other research contexts, as well. 

Expressive movement remains a classic but surprisingly undeveloped field for 
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studying personal style. A series of neat little studies by Zweigenhaft & Marlowe 
(104) clearly established links between signature size and various indices of status 
and self-esteem. Taken together with Zweigenhaft’s (103) earlier findings, these 
studies point to the promise of work on graphic expression, and invite more 
searching looks at theoretical implications. 

The explosion of work on nonverbal communication gives new impetus to the 
study of expressive styles—particularly work on eye contact and emotional ex- 
pressiveness. While most of the current interest in eye contact is concerned with 
communication processes, the eyes also provide cues to deeper, more permanent 
aspects of personality. Libby & Yaklevich (61) studied the effects of three per- 
sonality needs on maintenance of eye contact and gaze aversion during structured 

‚Interviews. As predicted, nurturant subjects returned the interviewer’s gaze; sub- 
jects high in abasement looked leftward (not downward) when shame and humil- 
iation were invoked; intraception was unrelated to ocular responses; and women 
maintained eye contact more often than men. The complex findings, discussed in 
terms of theories of Ekman and Tomkins as well as findings from ethological and 
right-left hemisphere research, offer provocative leads for further inquiry. Con- 
scious attempts at duplicating emotional expressions, however, are not strictly 
“expressive” and involve other considerations, as Draughan’s (29) study reveals. 
Most subjects improved when a mirror was used; however, anxious people were 
inhibited by the use of a mirror, and nonanxious people were not. 

Interest in personality style remains centered in cognitive styles—and the ana- 
lytic vs global style (field dependence-independence) remains the hardiest of our 
stylistic variables. Yet serious questions are raised: about the implicitly evaluative 
nature of the construct (with field independence defining the “desirable” style in 
most studies), about implicit sex bias (with males typically more field-indepen- 
dent), and about the generality of field dependence-independence (since the 
defining tasks involve spatial abilities). Several contributions this year have ad- 
dressed these questions. DeFazio (28) examined the analytic-global style in a 
linguistic context and found that field independents performed better on the Cloze 
Test and speech-shadowing tasks. Thus the analytic style seems to represent more 
than spatial ability. Within his carefully drawn (and thus somewhat special) 
sample, field-independence was associated with language competence, but sex was 
not. Other studies found field independence related to empathy (65) and enhanced 
by children’s practice in meditation (62). Nevill (70) studied the effects of depen- 
dency arousal upon both eye contact and field dependence-independence. Among 
her findings were the following: dependency arousal increased duration (but not 
frequency) of eye contact, as well as field-dependent responding; field dependence 
was consistently related to duration of eye contact; women were more often field 
dependent, and tended to maintain eye contact. 

Sex differences in two cognitive-perceptual styles—differentiation and stimulus 
intensity control—were studied by Silverman et al (85). This work introduced 
several new features: assessment of other dispositional variables, the use of an 
EEG measure of perceptual differentiation, and replication with a clinical sample. 
The results point to significant sex differences and pose new questions. For ex- 
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ample, among males high anxiety and neuroticism were associated with undiffer- 
entiated perceptual responsiveness and with stimulus intensity augmentation; 
among females, these were associated with differentiation and with stimulus 
intensity reduction. Sex-typed results with “normal” samples were reversed with 
clinical subjects (mainly acting-out adolescents). Further, EEG-evoked-response 
measures of perceptual differentiation (offered as a promising way of studying 
basic phenomena among infants, psychotics, subjects in drugged states) point to 
the role of neurophysiological functioning in perceptual style which demands 
further inquiry. 

An information-processing approach to cognitive differences between repres- 
sors and sensitizers is well illustrated by Pagano’s (74) use of the Sternberg choice 
reaction-time memory-search task. Taken as a whole, his findings imply that 
cognitive style differences between repressors and sensitizers are not to be found at 
the level of “taking in” external stimuli; rather, events occurring at the level of 
memory organization and internal coding of experience are implicated. Other 
links between new work in cognition and personologists’ concern with personal 
style seem clearly worth developing, 


Emotions and Personality 


Emotions hold a secure and central place in any account of personality—yet it 
remains difficult to define that place. With the growing discontent that psychology 
has become “too cognitive,” neglecting the affective side of experience and behav- 
ior, considerable recent work has sought to look more closely at cognitive-affec- 
tive dynamics. Efforts are still handicapped by the lack of any widely shared 
theory or well defined concepts of emotion. Most personality theories, as Izard (48) 
noted, have tended to ignore the problem of delineating the separate emotions and 
their effects on cognitive and motor processes. Even general concepts of emotion 
are discussed in confusing or contradictory terms; thus, for example, “feeling” 
means quite different things to Rogers and to Jung. A sampling of this year’s 
contributions reveals the scattered quality of the growing body of work on emo- 
tions, and invites further efforts at clarification and integration. 

Strongman (92) offered a broad survey of approaches to the psychology of 
emotion, drawing together, comparing, and evaluating representative views. 
While this is a useful volume, it does not fully address personologists’ concerns, 
and in its stress upon methodological and behavioral criteria it seems unlikely to 
replace Izard’s (48) recent survey. Ekman’s (31) contribution to the Darwin cen- 

-tennial volume closely examined the “culture-specific” vs “universal” views of 
emotional expression, noted the problems in conducting cross-cultural inquiry, 
and weighed the evidence on both sides of the controversy. This extensive, 
in-depth review (which includes a valuable summary of work by the Ekman and 
Izard research groups) substantiates a clear conclusion: there are universal facial 
expressions of emotion. In attempting to explain the basis for the demonstrated 
pan-cultural facial expressions of emotion, Ekman (following Tomkins) offers a 
neurocultural theory postulating innate facial affect programs along with socially 
learned and culturally variable elicitors of the affect programs. 
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While the intricacies of emotion theories are beyond the scope of this review, it 
appears that neo-Jamesian theories are now in ascendance; among these, 
Schachter’s arousal-plus-cognition formulation seems the most influential. In this 
context, recent work on facial expressions suggests the need to give greater re- 
cognition to discrete emotions and to the role of feedback from facial and postural 
(rather than merely visceral) responses in accounting for the experience of emo- 
tion. 

Laird (58) found that subjects led to “frown” or to “smile” without awareness of 
the nature of their expressions reported feelings of anger and happiness, respec- 
tively, and found cartoons funnier when smiling. This demonstration that facial 
expressions can induce appropriate emotional expressions bears upon theoretical 
questions. While the study is presented in terms of self-attribution theory, Laird 
noted that the results are equally compatible with any of the neo-Jamesian for- 
mulations; the preference for an attribution model rests mainly on the salience of 
cognitive variables in Schachter’s studies, and on continuity with attributional 
processes in social psychology (as contrasted with the “sensory” emphasis seen in 
alternative models). However, several features of the study (including the primary 
concern with facial expressions) suggest that its contribution might more readily 
be assimilated to the Tomkins-Izard formulations. 

Openness and depth in the experiencing of feelings are central in several 
versions of humanistic psychology. Wexler (98) addressed this issue, proposing 
that “depth of experiencing” reflects the amount of cognitive activity expended in 
elaborating and synthesizing facets of meaning with respect to the emotional state. 
Students described their feelings while experiencing three different emotions and 
subsequently rated their “involvement” in each; independent judges reliably rated 
depth of experiencing, and the tape-recorded narratives were content-analyzed for 
differentiation and integration of meaning. As predicted, self-report and external 
judgments of “depth” were related to the cognitive measures; further, positive 
affects were both more involving to the subjects and produced more differentiated 
and integrated meanings. The findings seem to go beyond the theoretical 
frameworks invoked (humanistic psychology, Schachter’s theory of emotions) to 
require a more complete account of the processes involved. Again, Tomkins’ 
theory would seem more apposite in its account of the recruitment of thought and 
imagery in the notion of “ideo-affective density.” 

For the personologist, the area of cognitive-affective dynamics poses some of 
the most exciting and formidable challenges to be found in contemporary 
psychology. This is an extremely broad domain, incompletely mapped by several 
tenuous, complex, and perhaps contradictory formulations. Only a few new ex- 
plorations can be noted here. 

Radical reformulations of traditional approaches to learning and concept for- 
mation are implicit in work on affects. Unpublished work by Levy (N. Levy, 
personal communication, March 1974) points out that most analyses of emotion 
have failed to note that emotions are ideas, and thus can be studied as concepts. A 
compatible formulation has been advanced in Rychlak’s (82) program of research 
on “reinforcement value” in learning. He observed: 
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The term ‘conceptual’ is used widely in psychological theory, usually in inappropriate 
fashion. To conceptualize is to actively create ideas ... The essential nature of affect 
(feelings, emotions, and so forth) is that it all comes down to judgments... the student 
of affective factors wishes to raise his dimension to the status of a basic dimension in 
human learning. The conceptual and nonmotivational side to affective influences on 
learning is that they are... acts of evaluation... This evaluative capacity is far more 
central to human learning than laboratory-centered theories of learning using lower 
animals and IBM machines as analogues have made it appear (82, pp. 253-54). 


This basic conceptualization has been developed in a series of researches on 
learning styles which have major implications for the assessment of ability, racial 
differences in intellectual styles, and more fundamentally, for understanding 
personological bases of learning. Importantly, Rychlak also seems to provide an 
independent formulation of the “feeling function” of Jungian theory, and thus 
offers another potential bridge between disparate lines of inquiry. 

The extremely rich formulation of cognitive-affective dynamics available in 
Tomkins’ (94, 95) theory has not as yet been sufficiently exploited in empirical 
work. Within the year, Singer’s (87) program of research on play and imagination 
invoked the theory as a conceptual framework, and a series of studies of an 
affect-based typology of smoking (46) added to its construct validity. However, 
more central aspects of the theory deserve translation into testable formulations 
and increased attention in other research contexts. 


Neglected Themes: An Apology 


With dismay I recognize how much has been left out of this chapter, and offer a 
deep apology to readers whose major research interests have been totally ignored. 
One might properly expect to find discussion of new work on dreams (20, 21, 26, 
100), fantasy and imagination (87), on ego strength and ego defense (35), on 
anxiety and aggression, the formulation and functioning of the self-concept, on 
creativity, and much more. Neglected, too, are examples of methodological alter- 
natives available to personologists (19, 25, 83) and critical discussions of more 
familiar methods (54, 101). Since “personality” includes the whole human being 
—and thus most of what psychologists study—one must be severely selective. 
Selection based upon the ripeness of problem areas for contributing to theoretical 
development of the field carries the enormous price of neglecting many valuable 
contributions. 


SOME FINAL WORDS 


The optimism of the opening sentence may seem more a wish than a prediction; a 
review of the field shows more possibilities than performance. But the message is 
still optimistic. Psychology seems to have entered a liberal and humane intellec- 
tual climate, appreciating the complexity of the human being, and prepared to 
reclaim a neglected heritage of ideas. We have “caught up” with James, Jung, and 
McDougall and can begin to build upon their ideas. Personologists have an 
important task: to capture the organization of psychological processes within 
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persons-in-societies. This surely requires a return to theory—but not in the sense of 
“comparing theories” as objects of study. Our efforts are better directed toward the 
conceptual clarification and integration that will enable us to pull together a vast 
empirical literature and better read its meaning. 

Personologists have always realized that we must look in two directions—toward 
biology and toward culture—and attempt some synthesis. Perhaps we might also 
look inward, remembering that the enduring theories were written out of the 
experience of Freud, Jung, James, and others. On a more modest level, ideas can 
be tested against our own experience—as can our choices of problems and meth- 
ods. Here Hudson’s (44) autobiographical critique of psychology and Krawiec’s 
(56, 57) collection of psychologists’ autobiographies offer delightful and abundant 
evidence that “personality” is alive, well, and functioning. 
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Our original review uncovered over 1500 articles and books related in some way 
to the study of attitudes and opinions. Of course, a much more modest number 
actually has been included in this review. The study of attitudes and opinions, 
broadly defined, has historically occupied much of the attention of social psycho- 
logy and underlies a good deal of research in other areas as well. To cut the field to 
a more manageable size, we have attended quite closely to the structure of the 
previous four chapters by Fishbein & Ajzen (92), Sears & ADENS (278), McGuire 
(206), and Moscovici (226). 


SURVEYS OF THE ATTITUDE AREA 


MONOGRAPHS One of the most discussed monographs published during the 
period of review was Attribution: Perceiving the causes of behavior, by Jones et al 
(151), which includes a number of provocative chapters. This important work is 
examined at several points later and need not be detailed here. Triandis (311) also 
has published an excellent brief review of the literature on attitude and attitude 
change, although there are others at a lower level (9, 106). Kiesler (171) and Duval 
& Wicklund (77) published research monographs on, respectively, commitment 
and objective self-awareness, each with rather extensive theoretical discussion. 
Rokeach (257) has published another volume in his continuing study of human 
values. Dawes’ (67) thorough introduction to attitude measurement should prove 
useful in a variety of educational settings. At a substantially higher level is the 
monograph on multidimensional scaling, edited by Shepard, Romney & Nerlove 
(282). 


' This review covers the period from January 1, 1971, to December 31, 1973, although we 
tried to include especially important articles published in the first quarter of 1974. For the 
most part we have not included preprints or unpublished work, and certainly made no 
systematic attempt to gather such material. Our debt to McGuire and Sears & Abeles for 
previous Annual Review chapters will be obvious to the reader. E 

2 Preparation of this chapter was facilitated by a grant to the senior author from the 
National Science Foundation GS-29722X2. 
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SOCIAL PSYCHOLOGY TEXTS Seven good new texts on social psychology appeared 
during our review. We are overcome by a plethora of riches. For slightly different 
reasons, we like best the works of Aronson (15) and Wrightsman (337, 338), 
although it is surely a matter of taste, since the others are very good by any 
standards. McClintock (204) and Insko & Schopler (138) take a more experimen- 
tal approach, while Elms (84) and Sampson (271) emphasize issues of social 
relevance. McLaughlin (209) stresses learning and social behavior. The level at 
which these books are written varies substantially. Insko & Schopler’s (138) book, 
because of their inclusion of a number of reprinted articles interspersed with text, 
could be used for a more advanced course too. We note also solid books by Crano 
& Brewer (63) and Hendrick & Jones (126) on research methods in social psy- 
chology that are suitable for undergraduates. 


NEW JOURNALS Three new journals appeared on the scene: The Journal of 
‘Applied Social Psychology (a new JASP!), edited by Siegfried Streufert, Howard 
Fromkin, and Joan Suppes; The European Journal of Social Psychology, edited by 
Mauk Mulder, Gustaf Jahoda, Serge Moscovici, and Peter Schönbach; and the 
Journal for the Theory of Social Behavior, edited by Paul Secord and Rom Harré. 
EJSP publishes especially good work, in our judgment, and is indicative of the 
progress of social psychology in Europe in recent years. The other two journals are 
competitive and illustrate the burgeoning nature of social psychology. | 


COLLECTIONS OF READINGS Ordinarily collections ofreadings have not been men- 
tioned in this series of reviews. However, since they may prove useful to the reader, 
we will list a few. Probably most central to the study of attitude and opinion 
change are the readers edited by Himmelfarb & Eagly (130), Malec (196), 
Suedfeld (302), Thomas (307), Freedman, Carlsmith & Sears (94), King & 
McGinnies (180), and McGinnies & Ferster (205). Several other edited works are 
concerned with various social issues (43, 115, 134, 249). A book edited by Rappa- 
port & Summers (253) contains a number of original and thoughtful articles on 
human judgment and social interaction. 


RELATED WORK Triandis’ (312) book on subjective culture and the work of Glass 
& Singer (101) on urban stress should be of special interest to our readers. 
Thoughtful work on prejudice was published by Campbell (49), LeVine & 
Campbell (191), Coleman (55), and Jones (156). The last is a stimulating paper- 
back designed for use in an introductory course in social psychology. 

Experimental social psychologists are becoming much more interested in 
problems of program evaluation, a field logically intertwined with the study of 
attitude change. The interested reader is commended to the recent books by Caro 
(50), Rossi & Williams (266), Williams (331), Zaltman (342), and Zurcher & 
Bonjean (350), all of which are substantial additions to the field. 

Our readers might also be interested in books on interpersonal attraction (47, 
229); personality (222, 330); the social psychology of groups (17, 197, 343); the self 
(348); motivation and emotion (32, 54, 83, 321); and several books critical of 
theory and method in social psychology (70, 116, 141, 161, 286). 
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METHODOLOGICAL ISSUES 


MEASUREMENT AND SCALE CONSTRUCTION Phifer (252) has shown that in the 
own-categories procedure an individual may select a disproportionate number of 
acceptable statements as a result of greater ability to discriminate among items 
that fall within the latitude of acceptance. Ostrom (241) has proposed a technique 
to increase the range of items sampled from the “universe of content,” but the 
procedure appears to rest heavily on obtaining a representative sample of subjects 
to generate items. 

The problem of interpretation and assessment of neutral attitudes has also 
received some attention. Edwards & Ostrom (81) have investigated the influence 
of underlying cognitive structure, based on the premise that the certainty with 
which neutral attitudes are held is a function of the homogeneity of information 
regarding the object. Kaplan (162a) has approached the ambivalence-indifference 
issue with a modification of the semantic differential measurement technique, 
where the problem is most apparent in the double labeling of the neutral midpoint 
of the scale. 

A new problem may exist for scales using bipolar adjectives. Reich (254) has 
found that the subjectivity of the judgment scale interacts with the subject's 
involvement with the issue. Subjects low oh involvement tended to show greater 
uncertainty when a subjective dimension was employed in a modified Thurstone 
categorization task. 

The most innovative recent approach to attitude measurement is the bogus 
pipeline paradigm presented by Jones & Sigall (153). For those unfamiliar with 
this ignis minimus, the procedure is one in which a subject is led to believe that a 
physiological index of his true attitude response tendency is obtained by a piece of 
sophisticated electronic equipment. The paradigm possesses the advantages of 
increased sensitivity of measurement (58) and the inhibition of the influence of 
social desirability (285). Although the extent to which the technique can be 
generally employed may be limited (240), its potential pitfalls appear to lie more 
with the user than with the procedure itself (154). 

Miron (221) has questioned the wide application of the semantic differential 
scale (and underlying factors) to specific semantic domains substantially removed 
from the original derivation. For an interesting discussion of semantic mediation, 
see Miron (220) and Osgood (239). 


ATTITUDE-BEHAVIOR RELATIONSHIP The historic problem of the “failure” of at- 
titudes to predict behavior continues to receive considerable attention. The most 
common tack to increase the predictability of behavior from attitude has been the 
“other variables” approach. In addition to the central attitude, measures are also 
obtained of related and possibly conflicting attitudes and of the individual’s 
perception of situational constraints which are thought to mediate or influence the 
attitude-behavior relationship. Wicker (325) found that the best predictor of 
behavior was the person’s estimate of his response given the occurrence of un- 
planned extraneous events. Weinstein (322) assessed attitudes toward the issue, 
toward potential action, and toward the combined referent and action stimuli, 
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with the latter measure yielding the highest correlation with behavior. In a 
refinement of this approach, Rosen & Komorita (259) used perceived effectiveness 
of actions (to result in change) as weights for a series of actions which differed in 
level of behavioral commitment, and assessed subjects’ willingness to engage in 
behavior. By summing over the weighted behavior, an Action Potential Index was 
derived which correlated .59 with the behavioral criterion. Factors which have 
been included in the other variables approach to the attitude-behavior relation- 
ship are potential consequences of behavior (96), perception of social norms and 
reference group expectancies (2, 7), and personal norms (277). 

The solution of the attitude-behavior problem by the “other variables” ap- 
proach may not be as straightforward as it appears. Sample & Warland (270) have 
found that the certainty with which the central attitude is held moderates the 
increase in predictability contributed by measuring other variables. Problems may 
also exist in deciding when a variable is an “other” variable. Rokeach & Kliejunas 

(258) found that attitude toward the situation, cutting class, was a better predictor 

of behavior than attitude toward the object, the specific instructor. In this case, the 
attitude toward the situation could be considered the more basic construct, and 
illustrates the somewhat artificial distinctions among attitude, behavior, and re- 
lated variables (173). 


BEHAVIORAL INTENTION The term behavioral intention is not defined consis- 
tently in the literature. It has been used both as a predictor of behavior (in the 
sense of being a measure sensitive to other variables that may influence behavior) 
and as a dependent measure of behavior, Perhaps the most comprehensive as- 
sessment of the role of behavioral intention in predicting behavior has been 
conducted by Kothandapani (181), employing the issue of attitude toward birth 
control. Four verbal measures (Thurstone, Likert, Guttman, and Gilford) were 
independently constructed for each of three components of attitude—feeling, 
belief, and intention to act. The twelve scales were administered to 50 users and 
nonusers and yielded a 12 X 12 multitrait-multimethod correlational matrix. The 
three components of feeling, belief, and intention to act were confirmed by two 
factor-analytic techniques. Regardless of method of measurement, intention to act 
served as the best predictor of behavior. 

Much of the work regarding behavioral intention has been done by Fishbein 
and his associates (e.g. 5) and has been summarized by Ajzen & Fishbein (6). 
According to Fishbein’s model, an individual’s attitude toward the act (Aact) and 
motivation to comply with normative expectancies of others [NB(Mc)] are the 
predictors of behavioral intention (BI). Behavioral intentions are presumed to 
correlate highly with behavior as denoted in the symbolic representation of the 
model (B ~ BI). Ajzen & Fishbein state that “the more abstract or generalized the 
measure of intention the lower will be its correlation with a specific behavior.” 
Consistent with this statement, the studies which they review show correlations of 
. BI with B to range from .21 to .97. The model as it now stands has received 
criticism from Schwartz (277) for its exclusion of personal normative beliefs as a 
predictor of BI. At a more fundamental level, Smith & Clark (289) take issue with 


ATTITUDES AND OPINIONS 419 


the formulation of attitude as a function of attributed characteristics and belief 
strength (4, = È Boch, finding that an individual component (a) predicted A, 
better than = Ba, They conclude that the approach precludes the possibility of 
rigorously sorting out traits associated with the attitude object from those that are 
subsequently inferred and attributed to the object. 

As a final note on the attitude-behavior relationship, we find ourselves in 
agreement with Dillehay (72) “on the irrelevance of the classical negative evidence 
concerning the effect of attitudes on behavior.” Yet it is somewhat disconcerting 
that Bickman (34) reports interviewing over 500 individuals regarding personal 
responsibility to pick up litter when they see it and receiving affirmative responses 
for 94% of his sample, only to have less than 2% of the respondents pick up a piece 
of planted litter as they departed from the interview. 


SOCIAL PSYCHOLOGY OF THE EXPERIMENT Frequent references to methodological 
safeguards and the sometimes cautious interpretation of results reflect the general 
awareness and concern with the influence of subjects’ role, demand characteristics, 
and experimenter bias on the internal and external validity of experimental 
results. These variables are worthy of study in their own right, and the research on 
them probably helps us to become better experimental social psychologists. 
However, the data accrued under the rubric of the social psychology of the 
experiment do not offer a convincing explanation of other research. Of course, if 
one is predicting only main effects in an experiment, then alternative explanations 
are easy to come by, including the ones to be discussed in this section. 

Although it is not the most important methodological issue, interest in the 
nature of subjects’ role behavior in the experimental setting has instigated the 
most research. An excellent review by Weber & Cook (318) looks at the evidence 
for the most frequently hypothesized forms of such roles (the good subject, faithful 
subject, negative subject, and apprehensive subject) in a number of research 
paradigms and provides some resolution of the overall picture. Adair & Schachter 
(3) reexamined the conclusion of Sigall et al (284a) that subjects were motivated by 
evaluation apprehension to “look good” rather than to cooperate. When the 
experimenter’s expectation was not explicit, subjects showed an increase in per- 
formance, replicating the finding of Sigall et al. However, when the experimen- 
ter’s expectation was made explicit, subjects decreased performance. Adair & 
Schachter conclude that subjects tended to confirm the hypothesis they perceived 
as correct (although perception of correctness was assessed after behavior). How- 
ever, the design of the study does not allow discrimination between interpretations 
based on good subject or apprehensive subject behavior. Rosnow et al (262) 
employed an attitude change paradigm in which subjects were made aware of the 
demand to change by pretest cueing and found that when subjects believed 
yielding reflected negatively on their self image (indicated by a low IQ), there was 
less change than when the implication for self image was positive. The findings of 
this study are consistent with the conclusion of Weber & Cook that when the good 
subject and apprehensive subject roles are pitted against each other, the subject’s 
-Tesponses are consistent with those expected for evaluation apprehension. Rosnow 
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& Aiken (261) have taken a different tack in approaching the investigation of bias 
in the psychological experiment. According to their formulation the manner in 
which a subject reacts to demand characteristics depends upon three mediating 
variables: (a) subject receptivity to demand; (5) motivation to comply in a posi- 
tive, negative, or neutral manner; and (c) ability to comply. 


VOLUNTEERING In the Rosnow (262) study the subject samples included both 
volunteers and nonvolunteers, but source of subject was not related to attitude 
change behavior. Hood & Back (133) have stated that subjects volunteer because 
of a cathartic need for self-disclosure. However, their data show the effect occurred 
only for males. Further, their data revealed the somewhat inconsistent finding that 
among volunteers there was a preference for a study involving competition rather 
than alternatives allowing more self-disclosure. Kruglanski (183) has proposed 
that the continued investment of resources in research investigating volunteer 
artifacts is unwarranted. He concludes that although volunteer artifacts may 
always be a potential source of bias, it has not been demonstrated that they act as a 
“unitary psychological variable” or as a pervasive source of bias. Kruglanski’s 
conclusion regarding the consistency and pervasiveness of the volunteer artifact 
has not gone undisputed (263). 


DECEPTION AND SUSPICION Rubin & Moore (267) have found that subjects in a 
social influence study showed a tendency to resist influence with increasing levels 
of suspicion, They also tested the effectiveness of an open-ended questionnaire for 
detecting suspicion against a criterion of an extended debriefing interview. The 
questionnaire was effective in discriminating three levels of suspicion, and a form 
using an impression set was somewhat more effective than one using a simple 
recall approach. | 

The effects of a subject’s awareness of deception on experimental behavior has 
received further attention from Cook & Perrin (57). Whereas subjects who had 
previously experienced deception tended to respond in a biased manner, those 
who had learned about deception in the abstract did not. When subjects were cued 
that the study might involve deception, the difference in performance between 
those who had experienced deception and those who had knowledge of deception 
was eliminated. An additional finding of interest was that suspiciousness and 
reported legitimacy of deception were positively related. In one of a series of 
studies investigating the role of deception in psychological research, Holmes & 
Bennett (132) found that awareness of possible deception did not influence 
subjects’ physiological or self-report responses to threat of shock. 

According to Newberry (230), a questionnaire was more effective than direct 
questioning in inducing subjects to admit to having foreknowledge on a problem 
solving task regardless of whether the approach was characterized by positive or 
negative demand. Those subjects who had received information about the task, 
including those who would admit it, used it to improve their performance al- 
though perhaps not intentionally. 

Judging from the distribution of research in this area, it is the subject who is 
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perceived to be the source of bias in psychological research. Research directed at 
elucidating the variables that contribute to experimenter bias has received little 
recent attention. A notable exception is a study by Jones & Cooper (158), which 
showed experimenter-subject eye contact to be associated with subject affect 
which influenced responding on a person perception task. 


SUBJECTS’ PERCEPTIONS AND EXPECTATIONS OF THE EXPERIMENTAL SETTING From 
research in this area it appears that subjects do not find the use of deception and 
encountering certain noxious events (such as being shocked or asked personal 
questions) to be inappropriate (86, 300). However, this may not be the case if from 
the subject’s point of view the behavior in question is not justified or the research 
does not contribute to science. Sullivan & Deiker (303) have also found that a 
sample of psychologists responded more conservatively to questions of experi- 
mental ethics than did students. Those who read the report of the APA Ethics 
Committee (8) were already aware of psychologists’ conservatism in this area. 

One of the approaches used to obtain information of subjects’ reactions to 
experimental settings is the simulated experiment. This method has also been 
presented as a device to circumvent the restrictions imposed by requirements for 
informed consent, by obtaining a subject’s consent on the basis of his reaction to a 
description of a similar study (33). However, today’s experimental simulation is 
yesterday’s role playing with a little less action, and some caution should be 
exercised in generalizing from the armchair speculations of college sophomores. 
For a review of role playing, see Miller (214). 


DISSONANCE 


FORCED COMPLIANCE Previous reviews have noted the controversy over the 
effects predicted by dissonance theory in the forced compliance paradigm (92, 206, 
278). Recent research has concentrated on investigation of the conditions under 
which the dissonance effect occurs in the forced compliance paradigm. Several 
studies seem worth noting in detail. All focus on the phenomenological aspects of 
the behavior. That is, assume a person has acted (or been induced to act) incon- 
sistently with his beliefs. The dissonance model predicts that the person will 
change this attitude to fit with his behavior to the extent that he freely chose to act, 
or was financially unrewarded for it, or perceived the behavior to be relatively 
unjustified on other grounds. Almost no investigator disagrees that such an effect 
occurs, but almost everyone agrees that the effect is substantially more limited 
than originally supposed. Recent research has focused on the dissonant behavior 
itself: whether it has negative consequences for self or others; whether the conse- 
quences were foreseeable or expected; whether the person felt responsible for the 
consequences or otherwise intended them to occur. 

Cooper (58) has suggested that dissonance occurs only when one feels person- 
ally responsible for his behavior. In his study, subjects chose (or were forced) to 
work with a partner with a negative personal characteristic which ultimately led to 
their joint failure on a task. All partners possessed a negative characteristic; but 
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the degree of negativity was systematically varied, as well as whether the subject 
expected the partner to have that particular negative characteristic or a different 
one. The dependent variable was liking for the partner. Cooper found, as expect- 
ed, that the subject increased the perceived attractiveness of the partner only 
under high choice and when the negative trait had been expected or foreseeable, 
but especially when the trait was very negative. Cooper concludes that only when 
the subject could foresee, and therefore presumably feel personally responsible 
for, the negative implications of his choice did he experience dissonance. In a 
sequel to Cooper’s study, Worchel & Brand (334) varied the source of the subject’s 
expectation regarding the partner. They found the dissonance effect to occur only 
when the subject was responsible for his expectation (as opposed to the expecta- 
tion being based on a test score). ) 

Goethals & Cooper (102) varied choice and consequences of one’s behavior 
independently of one’s intention to produce those consequences. They found that 
if the subject chose to make a counterattitudinal speech, and if the listener were 
convinced by the speech, then the subject changed his own attitude to fit more 
closely with his behavior, regardless of whether he intended to convince the other 
or not. A second experiment, clarifying the manipulation of intention, replicated 
the basic results. Thus when subjects freely chose to engage in counterattitudinal 
behavior, they changed their attitudes only when the behavior had aversive 
consequences and regardless of whether the actor intended to effect those conse- 
quences. Cooper (58) and Worchel & Brand (334) emphasize that the perceived 
responsibility for one’s behavior is critical in producing dissonance, while Goeth- 
als & Cooper (102) stress the consequences of one’s behavior. The two lines of 
thought do not conflict, if one assumes that although the subjects of Goethals & 
Cooper unintentionally convinced the listener, they still might feel personally 
responsible for the effect. Indeed, Pallak, Sogin & Van Zante (246) found that nega- 
tive consequences increased task evaluation only under high choice and when sub- 
jects made an internal attribution of causality for the consequences of the choice. 

Collins & Hoyt (56) had subjects write a counterattitudinal essay, and inde- 
pendently varied financial inducement, consequences, and self-responsibility for 
the effects. Significant attitude change occurred only under low financial induce- 
ment and when the subject perceived that not only did his dissonant behavior have 
serious possible consequences, but that he would be personally responsible for 
them. This same basic result was also found after 14 days in a delayed posttest. In 
a separate study, Hoyt, Henley & Collins (136) found a similar interaction between 
choice and consequences. Collins & Hoyt had further predicted that without 
consequences or responsibility, incentive would have a positive relationship to 
attitude change, but this prediction was not borne out (indeed, the data were in the 
opposite direction). Nonetheless, Collins & Hoyt are to be commended for the 
inclusion of a delayed posttest as well as their unique attempt to manipulate 
independently both the consequences of subjects’ behavior and their perceived 
responsibility for those consequences. The reader may also be interested in the 
Collins & Hoyt article for its extensive review of the literature on the forced 
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compliance paradigm [also see Tate (304) for an annotated bibliography of the 
‚literature on counterattitudinal advocacy]. 

Calder, Ross & Insko (48) varied choice, incentive, and consequences (as well as 
also having subject-observers in each variation). Their results are complicated. 
With serious consequences, there was an interaction of choice and incentive: high 
choice/low incentive subjects changed more, as expected, than high choice/high 
incentive subjects. The effect was reversed under low choice. There was also an 
interaction of choice and incentive with more trivial consequences, and the two 
contributing comparisons were opposite those found with serious consequences. 
In other words, high choice, low incentive and serious consequences were 
necessary to produce the dissonance effect, a finding that fits with other results. 
The implications are less clear regarding a positive relationship between incentive 
and change with combinational conditions of both high consequence/low choice 
and low consequence/high choice. 

The results from the subjects-observers of Calder, Ross & Insko (48) are inter- 
esting, although their theoretical meaning is unclear. In general, observers’ guesses 
were more depressed than subjects’ responses (a main effect), although the general 
array of means was more or less the same. However, an inference of change is less 
clear in the case of observers. For example, consider their high choice/serious 
consequences conditions. For involved subjects the control mean was 13.20, the 
high incentive mean was 14.80, and the low incentive mean was 21.33. The 
inference that low incentive subjects changed their attitude is quite straightfor- 
ward. However, for observers, the three means are 11.00, 5.80, and 11.70. Even 
though the difference between incentive conditions is in the same direction (low 
greater than high) it is not at all clear that observers were engaging in the same 
process that involved subjects did. It seems instead that observers see the 
change occurring in the high incentive cell rather than the low incentive cell. It is 
commendable that Calder et al included observers in their design, but further 
work needs to be done before the meaning of this comparison becomes clear. 

Frey & Irle (95) varied choice, incentive, and whether the subject’s counteratti- 
tudinal essay would be anonymous or made public. In the public variation, the 
essays would be published with the subject’s name and would be used in classroom 
discussions in which the subject would be asked to defend himself. The public 
variation bears some resemblance to the manipulation of aversive consequences 
used by others. Consistent with other studies, Frey & Irle found dissonance effects 
only under conditions of publicity, high choice, and low incentive. Unlike some 
other studies, they also found increased attitude change when the essay was to be 
made public (aversive consequences) and the subject had no choice but received a 
high incentive. 

Typically in forced compliance studies the subject is induced to perform an act 
inconsistent with his beliefs. Darley & Cooper (66) conducted an experiment in 
which the subject was induced to refuse to act inconsistently [see Kiesler (171) for 
a description of a very similar study]. They reasoned that if the subject refused to 
comply, then the greater the money offered (and refused) the greater the disson- 
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ance. Having rejected the money, subjects should justify their behavior by devel- 
oping an even more extreme attitude, thereby producing greater discrepancy 
between attitude and requested behavior. By altering the demands of the setting 
somewhat, Darley & Cooper were able to induce every subject to refuse to write a 
counterattitudinal essay. Subjects who refused $1.50 to comply became more 
extreme in their beliefs, and more so than controls or subjects who only refused 
50€. As Darley & Cooper (and Kiesler) point out, there is some danger in extra- - 
polating forced compliance results simplistically to applied settings. If applied 
social scientists attempt to change attitudes through the forced compliance tech- 
nique, they should be careful. If subjects refuse to comply, they may well develop 
more extreme attitudes than they previously held. 

These experiments on the forced compliance paradigm suggest that the most 
reasonable current statement of both the definition of dissonance and the condi- 
tions under which the dissonance effect will occur is the following. The person will 
experience and reduce dissonance if he performs an act inconsistent with his 
beliefs and if the act has aversive consequences for the person or others, and if the 
person perceives that he bears some personal responsibility for the act. The exact 
meaning of personal responsibility is unclear; the various experiments indicate 
that it cannot be simply equated with intentionality, choice, justification of finan- 
cial inducement, or expectation. For example, Calder, Ross & Insko (48) found 
dissonance effects only under low incentive and high choice. Neither variable 
alone produced the effect. If we presume that responsibility for one’s behavior is 
the mediating variable for dissonance-like effects, then neither choice nor incen- 
tive could be equated with responsibility. Future research will undoubtedly delve 
more deeply into this interesting question. 

Cooper & Duncan (60) worked on a different aspect of the definition of dis- 
sonance. They tried to test Aronson’s (13) idea that dissonance results from a 
violation of one’s self-image (e.g. lying to others produces dissonance because one 
is not the sort of person who lies to others). Cooper gave positive, negative, or 
neutral personal feedback to subjects about self. Subjects then wrote a counterat- 
titudinal essay for either 50€ or $2.50. Contrary to expectations based on the 
Aronson notion, manipulated self-esteem did not interact with incentive (al- 
though, of course, subjects receiving the smaller incentive showed greater attitude 
change). Subjects were further divided on self-reported self-esteem; again self- 
esteem did not significantly relate to attitude change. Indeed, contrary to 
expectation, there was a nonsignificant tendency for low self-esteem subjects to 
change more than those reporting high self-esteem. Aronson’s idea, that one’s 
concept of self supplies the cognitive implication (“I am not the sort of person who 

.”) that is violated by dissonant behavior, is intuitively appealing and certainly 
relates to the concept of self-responsibility for one’s behavior mentioned pre- 
viously. However, the study of Cooper & Duncan suggests that this theoretical 
cloak is ill-fitting. 


ALTERNATIVE MODES OF DISSONANCE REDUCTION Götz-Marchand (108) has 
completed three intriguing experiments which show that how one reduces disson- 
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ance depends upon the alternative ways offered and when. For example, in one 
study, subjects were led to believe that their IQ score was considerably below their 
expectations. Subjects were given a questionnaire of five items on separate pages 
relating to five different ways of resolving the dissonant input. Conformity (of IQ 
estimate) and test derrogation were presented either first or last (the rest of the 
sequence was constant). Subjects either completed the five items in order or looked 
at all five before beginning. Götz-Marchand found the dissonance reduction 
alternative emphasized depended heavily upon both order and knowledge of the 
sequence. When the conformity item was first in the sequence and subjects were 
unaware of the content of subsequent items, they changed their estimates of 
self-intelligence 10.1 units; when they knew the subsequent items they changed 4.7 
units (p < .01). The results for the test derrogation item show a similar interaction 
pattern (p < .01). How subjects reduced dissonance in this study depended greatly 
on which alternative methods seemed readily available. When they were not 
aware of subsequent items, they emphasized the first one presented to them. In 
other studies, similar effects occurred when the subject was allowed to change his 
answers or when one item was arbitrarily deleted from the list. This dissertation 
illustrates how cognitively flexible people are in resolving dissonant situations and 
how little we know about such processes. At minimum, these results suggest future 
studies should use much more sophisticated research designs. 


DISSONANCE AND OTHER VARIABLES Inthe period covered by this review, most of 
the empirical work in dissonance has focused on either the forced compliance 
paradigm described earlier or on comparisons with attribution theory, yet to be 
described. Less concentrated, but noteworthy, effort has gone into the study of: 
temptation (187, 347a); decision-making (22, 128, 193, 198); the consistency of 
dissonance effects across experiments (310) and other long-term effects.(314); fait 
accompli (59) and other postdecisional inputs (127, 144, 339); behavioral success in 
conditioning others (52); the commitment or reaction of the audience (36, 112, 
287, 328); the effect of skill in role playing (201, 317); the relationship of disson- 
ance to social judgment theory and involvement (28, 30, 37); personality variables 
(29, 46, 284, 329); effort (140, 251, 279); disconfirmed expectancies (41); inequity 
theory (225, 316); and declining to perform alternative behaviors (121, 216, 218, 
309). Ä 


SUMMARY Dissonance research has concentrated on discovering the critical var- 
iables underlying the forced compliance effect. Other research yet to be described 
has been concerned with comparing dissonance and attribution predictions. The 
sheer amount of research on dissonance theory has decreased quite dramatically 
over that contained in previous reviews, and the excitement generated by disson- 
ance theory in social psychology has obviously been dampened. The hope of a 
previous review that, “... the spectacular though currently unrealized promise of 
dissonance theory does not go the way of the brontosaurus” (278) has not been 
completely violated, but the theory might be put on the endangered species list. 
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ATTRIBUTION THEORY 


Attribution theory (25, 124, 149, 164, 165) appears to have replaced dissonance 
theory as the most popular theoretical model in social psychology. Further, the 
Dissertation Prize offered by the Society of Experimental Social Psychology was 
won in 1972 and 1973 for work generally related to attribution theory. Newtson ` 
(232) studied the implications for the attribution process of variations in the units 
of perception. Snyder (292) developed a scale to measure the degree to which 
individuals monitor and change their own behavior as a function of situational 
variations. Attribution theory is making its contributions to experimental work in 
attitude change in both quantity and quality. 

The sheer frequency of appearance of attributional analyses in the literature 
serves to document literally the heuristic value ofthe approach. Attribution theory 
has indeed served to stimulate investigation, and represents a goodly portion of 
the empirical excitement in social psychology today. Overall, however, the com- 
ments of the last reviewers still appear to hold: “... few of the recently published 
studies seem to be direct extensions or tests of the underlying theory. Instead, most 
of these studies seem to be based on intuitive hypotheses, and their relationships to 
a systematic theory of attribution are usually left unstated, although there are 
some notable exceptions” (92, p. 502). 

Aside from direct tests of attribution theory, its status qua theory is still ques- 
tionable. Phenomenological analysis and the empirical study of attributional 
contingencies are legitimate approaches and of interest in their own right. On the 
other hand, to focus exclusively on experience and observation is to ignore both 
science and theory. In the present review we emphasize those studies which 
explicitly or implicitly help to illuminate the theoretical underpinnings of attri- 
butional psychology. In the history of psychology, confrontation among theories 
has produced significant advances in knowledge and should be a sign of scientifi- _ 
cally healthy and exciting times. Consequently, direct attempts to test attribution 
theory and other theories are especially important. “Other theories” thus far have 
consisted mainly of dissonance theory, but there is no reason why theoretical 
confrontations should be so limited in the future. We also emphasize studies which 
reflect upon variables or issues unique to attribution theory and thereby help to 
clarify or explicate the theory, such as observer/actor differences. Lastly, we note 
that this is a review of attitude change, theoretically an attribution or inference 
about self. We therefore do not attempt to review thoroughly the multitude of 
studies in which attributions to others are the primary or sole dependent measure. 


Theoretical Issues 


PRETEST SALIENCE For reasons that are not clear to us, several investigators have 
seized upon the salience of one’s pretest attitude as a variable for which theoretical 
distinctions can be made between dissonance and attribution theory. Bem & 
McConnell (27) found that in a forced compliance experiment, subjects’ recall of 
their pretest attitude was very similar to that shown on a posttest. Bem & 
McConnell concluded that their subjects did not experience dissonance between 
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pretest attitude and counterattitudinal behavior, but rather simply inferred what 
their attitude must be on the basis of their behavior and its contingencies. An 
obvious alternative explanation of their results is that subjects did not want to 
admit to others, and possibly to self, that they had changed their attitudes. Subjects 
might not necessarily even be aware that they had changed; Goethals & Nelson 
(103) found, in a different setting, that subjects’ perceived change and actual 
change did not correspond. 

Several investigators (265, 293, 333) have posited that salience of pretest attitude 
is a variable about which dissonance theory and attribution theory make different 
predictions. A more salient pretest attitude, these investigators argue, should 
increase (dissonance and hence) attitude change from the point of view of dis- 
sonance theory, but decrease attitude change from the point of view of attribution 
theory. We do not completely agree with the theoretical prediction. To be sure, if 
one increases the salience of a pretest attitude, then one ordinarily increases the 
dissonance between that attitude and subsequent counterattitudinal behavior. 
However, this does not mean theoretically that more dissonance should be re- 
duced, since increasing the salience of the pretest attitude should also increase its 
resistance to change. The confound between two variables theoretically leading to 
opposite predictions means that the prediction from dissonance theory is proble- 
matical at best. 

The results still may have empirical utility. Snyder & Ebbesen (293) found that 
increasing the salience of the pretest attitude led to decreased change in the high 
dissonance condition, but also increased change in the low dissonance condition. 
They conclude against dissonance, but acknowledge that their results do not really 
fit with attribution theory either. They offer as explanation a variation of attribu- 
tion theory in which volition affects the perception of one’s behavior. Ross & 
Shulman (265) found that reinstatement of initial attitude did not decrease atti- 
tude change in a forced compliance experiment. Indeed the difference between 
their high and low choice conditions increased slightly when the initial attitude 
was made more salient. They conclude in favod of dissonance theory. Harris & 
Tamiler (118) found a similar result. These studies are of empirical interest since 
they provide clues about the process of change in the forced compliance paradigm. 
However, since the dissonance prediction is ambiguous, the setting does not 
provide the necessary conditions for a clear test between the two theories. 


THE ILLUSION OF UNIQUENESS Kelley (164) has suggested that subjects in high 
dissonance conditions perceive themselves to be uniquely agreeing to act contrary 
to their beliefs. In Kelley’s view the subject who complies under low incentive 
assumes (incorrectly) that he is unique in complying and infers therefore that the 
compliant behavior is actually consonant with true attitude. Cooper, Jones & 
Tuller (61) tested this proposition by telling some subjects that 90% of subjects had 
complied with the request to write a counterattitudinal essay and telling others 
that only 15-20% had complied. They found this manipulation of “uniqueness” to 
affect perceived concensus but it did not affect attitude change. Subjects receiving 
a small incentive for writing the essay changed their attitude more than subjects 
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receiving a larger incentive, regardless of perceived uniqueness. Further, in 
control conditions in which uniqueness was not mentioned, perceived uniqueness 
did not vary as a function of incentive. 


Motivational Issues 


The areas of arousal and motivation seem to offer potential for differentiating 
between attribution theory and dissonance: attribution being a more dispassion- 
ate inferential process, while dissonance is a theoretical state of arousal which the 
individual presumably experiences as a negative state. 

Zanna & Cooper (346) gave placebos to subjects in a forced compliance exper- 
iment. Some subjects were told that the pills would make them feel tense; others 
were told it would relax them; control subjects were told the pills would have no 
side effects. Zanna & Cooper based their theoretical derivation on two assump- 
tions: (a) dissonance is a state of arousal; and (b) subjects can be led to misattri- 
bute states of arousal. If both assumptions are valid, Zanna & Cooper reasoned, 
then subjects could be misled into attributing the source of arousal to the pill 
rather than their own counterattitudinal behavior, and therefore not change their 
attitudes to reduce dissonance. Their data followed their theoretical predictions. 
Control subjects, who expected no side effects from the pill, showed a dissonance 
effect with high-choice subjects changing their attitudes to correspond with their 
behavior. When subjects could attribute their arousal to the pill, the dissonance 
effect was virtually eliminated. When subjects were told the pill would be relaxing, 
the high-choice subjects changed their attitudes significantly more than in the 
control condition. Zanna & Cooper conclude that such an array of data could not 
have occurred unless dissonance produced a state of arousal recognizable to 
subjects (otherwise subjects would not attribute the source of arousal to the pill). 
They say, “While Bem’s and Kelley’s model may be considered as useful heuristic 
devices, and while they may accurately refiect the process employed by 
observer-subjects, the present results suggest that involved subjects do indeed 
perceive themselves to be aroused when participating in a counterattitudinal 
role-playing situation” (346, p. 708). It should be noted that Zanna & Cooper do 
not directly test the hypothesis that dissonance is arousing, but rather test the 
implications of assuming that it is arousing. An indirect test, but a provocative 
result. 

Pallak (242, 245) has tested the dissonance-as-arousal hypothesis several times. 
In one study (245) he varied both shock and dissonance (Separately but not 
orthogonally) and found dissonance and anxiety arousal (when subjects expected 
to be shocked in the future) to have very similar effects on incidental verbal 
learning, suggesting a parallel between anxiety and dissonance as drive states. 
Pallak & Pittman (245) found a significant interaction between level of dissonance 
and degree of response competition in two separate experiments. High dissonance 
subjects learned more than low dissonance subjects when response competition 
was low and learned less when response competition was high. This interaction 
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with response competition has been repeatedly found with other drive states and 
supports the hypothesis that dissonance is arousing. 

The work of Zanna & Cooper and Pallak and his colleagues is sufficiently 
compelling that one must explain it in some way if one wishes to maintain a 
theoretical position of attribution as relatively “cool.” The more general issue is 
motivation. If ane does attribute the cause of his behavior to self, why? If the 
person theoretically asks himself “Why did I do that?”, then what makes him ask 
the question? The research described above deals with the issue of the activity or 
arousal level of the person, addressing itself to one of the two classical issues on 
motivation, that of instigation to action. The other traditional issue of motivation, 
that of direction of behavior, has yet to be raised. One suspects that motivational 
issues, never a popular topic in social psychology in the past, may yet become a 
major concern in the future (172). 


Actor/ Observer Differences . 


In an important article, Jones & Nisbett (152) discuss the differing perceptions of 
actors and observers of behavior. “Actors,” they say, “tend to attribute the causes 
of their behavior to stimuli inherent in the situation, while observers tend to 
attribute behavior to stable dispositions of the actor.” This bias presumably is 
partly due to the actor’s greater knowledge of his own history, motives, and 
desires. However, there is also a directional bias in perceptual orientation, such 
that for the actor the situation is preeminent, whereas the observer focuses his 
attention on the actor. This statement has produced much discussion (see also 151, 
155, 203). 

Nisbett et al (234) conducted three different demonstrational studies which 
illustrate the different perspectives of actors and observers. In each case, actors 
were more inclined to attribute the cause of their own behavior to some aspect of 
the situation. Observers were more likely to see the cause as due to some predis- 
position of the actor. 

The issue of the differing perspectives of actors and observers was directly 
attacked by Storms (299). Storms reasoned that if attributions are literally in- 
fluenced by one’s focus of attention, then it should be possible to change the way 

„actors and observers interpret a behavior by changing their visual orientation. In 
this study, two strangers (actors) participated in a brief conversation while two 
observers watched. Each of the observers was visually oriented toward one of the 
actors. Through ingenious use of videotape, Storms subsequently was able to show 
the interaction again to actors and observers, either from the same visual per- 
spective as before (i.e. actor sees other actor on tape and observer sees the same 
actor he previously was watching) or a different perspective (i.e. subject sees self 
on tape; observer sees the actor not previously the focus of his attention). In both 
the control (no-tape) condition and the same-orientation conditions, subjects 

attributed the causes of their own conversational behavior to situational influences 
more than did their observers, as expected. However, when their visual orientation 
was changed through the subsequent use of videotape, both subjects’ and ob- 
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servers’ causal attributions of behavior were changed also. When subjects saw 
their own behavior on videotape, they became much more dispositional when 
explaining their own behavior. When videotape allowed observers the visual 
perspective of involved actors, they became much more situational in attributing 
cause to actors’ behavior. In short, Storms found that under ordinary circum- 
stances, actors tended to see their own behavior as situationally determined and 
observers see the same behavior as dispositionally determined. However, when 
Storms subsequently changed their visual perspective, he found the attributions 
for behavior to change also. 

Bem (25) originally proposed that observers and actors are “isomorphic,” that 
the inferential processes they use and the conclusions they arrive at are identical 
[although he no longer endorses this view in its extreme form (26)]. Jones et al 
(159) objected to both proposals regarding process and conclusion. Jones & Nis- 
bett now propose quite convincingly that observers and actors have different 
perceptual sets and hence infer different attributional causes. Storms has shown 
that the attribution by observers and actors can be quite similar if the perceptual 
sets are equated. Zanna & Cooper, among others, still maintain that the process, at 
least in dissonant situations, is quite different. The exact process of inference and 
the content of inference remain interesting empirical and theoretical issues. 


Attitudinal Attributions to Self 


In this section, we review the several studies in which the primary dependent 
measure is an attitudinal attribution to self. Since attitude change is the specific 
concern of this review, we take the point of view that studies involving attributions 
to self are more germane to the issue than those studies of attribution to others 
(although the latter are still of interest regarding theoretical issues). We note that 
the number of studies mentioned here does not necessarily circumscribe the 
interest in this topic, since a number of experiments previously described under 
the rubric of dissonance theory could as easily be explained by attributfon theory. 

There are few attributional studies within the traditional attitude change para- 
digms (182, 344, 345, 347). Some work has been done on attributions regarding 
influencing others and being influenced by them (75, 103). Munson & Kiesler 
(228) also studied the role of attributions by others in the acceptance of persuasive 
communications. Other studies concern effects of real (219) or false physiological 
feedback (19, 100, 117, 260, 313, 332), and the perception of personal causation 
(18, 185, 190, 202, 264, 276, 336) particularly with reference to the interpretation of 
incentives and rewards (69, 184, 188, 235). 

Nisbett & Valins (235) present an interesting discussion of rewards and incen- 
tives. They propose an “overjustification” hypothesis, viz. that large incentives for 
consonant behavior can undermine one’s attitudes related to the behavior. “Just as 
the information-processing view maintains that subjects in the insufficient justifi- 
cation conditions of dissonance experiments discover stimulus causes for their 
behavior, it can predict that subjects in ‘overly sufficient’ conditions of ‘conson- 
ance’ experiments will ignore the stimulus causes for their behavior” (235, p. 68; 
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quotes and italics in original).? Lepper, Greene & Nisbett (188) tested this notion 
by having children draw pictures for which they received no reward, an expected 
reward, or an unexpected reward. Several days later the children were observed 
using the materials in a free-play situation. The percentage of time the subjects 
played with the target activity was taken as a measure of intrinsic motivation. 
Children who had previously received an expected reward spent less time in the 
target activity than either those in the control or unexpected reward conditions. 
Surprisingly, from the point of view of the reviewers, the unexpected reward did 
not affect time spent in the target activity. 

The overjustification hypothesis has interesting possibilities in the application of 
attribution theory to the study of incentives and rewards (38). Some existing data 
clearly are not in line with the hypothesis [(236); see also (238) for a review of 
token reinforcement programs in the classroom], which suggests a good deal of 
experimental work yet to be done. Reiss & Sushinsky (255a) find, for example, that 
the data of Lepper et al do not replicate if one uses a multiple trial, operant 
procedure. 

One of the most intriguing and theoretically unique findings of attribution 
research has been Bem’s (23, 24) research on truth and lie lights. Bem originally 
found that if a person sees himself making statements under circumstances pre- 
viously associated with telling the truth, he will come to believe that the statements 
represent fact. In our laboratory, we have failed three times to replicate this 
finding. A replication by Maslach (200) also does not support the original finding. 
Maslach discusses an alternative explanation, based upon decreased vigilance 
induced by the truth stimulus (also see 172). She found that the lie stimulus 
promoted more cautious responding as evidenced by such vigilance indices as 
better recall, greater confidence, slower reaction time, and a different pattern of 


3 We find the hypothesis intriguing, but quibble with its roots in our own work. Kiesler & 
Sakumura (178; see 171 for related work) found that when subjects were induced to tape- 
record a speech consistent with their attitudes, incentives for doing so did not affect their 
attitude, but (as predicted) subjects receiving low incentives were more resistant to attack 
later. Nisbett & Valins describe an unpublished study by Carlsmith & Lepper, who report 
that subjects who received a small incentive for writing a consonant essay changed their 
attitude less in response to a counter-communication (than those receiving a larger incentive) 
regardless of whether the counter-communication came before or after writing the essay. 
This result fits neatly with Nisbett & Valin’s hypothesis, but we have not been able to 
replicate it in our laboratory. We find the resistance effect for low incentive to occur only 
when the counter-communication comes after the essay. We feel that our finding, although 
obviously related to the interpretations of our work on commitment, does not necessarily 
undermine the overjustification hypothesis in other settings or when the overjustification is 
more extreme. Our manipulations have been rather innocuous by design. One suspects that 
an overjustification would have to be much more extreme or obvious before the Nisbett- 
Valins effect would occur. Further, there is a substantial amount of evidence (114, 171, 177, 
269) that these commitment manipulations cue off an active process in the low-justifica- 
tion conditions (as opposed to an undermining of attitude in the high-justification condi- 
tions). 
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physiological activity. The original conclusion is suspect at best. Maslach says, 
“Further experiments on false confessions should use different paradigms which 
build in a greater degree of mundane realism, and in which the act of confession 
has greater consequences for the subject” (p. 146). 


Attributions to Others 


Research abounds in this area, but much of it is beyond a review of attitude and 
opinion change. In particular, the work on perceived ability of self and others 
continues to be a focus of experimental attention (e.g. 88-90, 97, 146, 168, 306, 308, 
319, 320). Weiner’s (321) book on motivation deserves special mention for its 
description of this work. 


CHOICE AND CORRESPONDENT INFERENCES An earlier study by Jones & Harris 
(150) found that subjects attributed an attitude to another consistent with his 
behavior even, although less so, when it was performed under low choice. Jones et 
al (155) found the attribution to low-choice others to depend upon the attitudinal 
explicitness of the behavior (essay strength in their experiment) and the prior 
attributions of the observer. When an actor wrote a weak essay under low choice, 
observers attributed an attitude to him opposite to his behavior. Lopes (195) shows 
how these data could be interpreted within the context of Anderson’s integration 
model (10, 129). Jellison & Davis (145) found that the extremity of another’s 
behavior also affects our perception of his competence, ability, and sincerity (also 
see 51, 166 and 210). Ajzen (4) found perceived probability of some behavior to 
depend upon both perceived choice and the utility of the behavior. Reisman & 
Schopler (255) point out that one could conceive of the person and his disposition 
to act as having a unit relationship and that the attributed responsibility for an act 
should vary as a function of the strength of the unit relationship. 


Victims and Responsibility 


Stokols & Schopler (298) found the evaluation of a victim to vary inversely with 
the severity of the victim’s misfortune and directly with expectations of future 
interaction. The victim’s responsibility for the outcome did not affect evaluations 
of her. Phares & Wilson (251a) found attributed responsibility to be a function not 
only of the severity of outcome but the ambiguity of the situation and the per- 
ceiver’s internal control (measured by I-E scale). Jones & Aronson (148) found no 
differences in fault attributed to rape and attempted rape victims, but divorced 
women were seen as more responsible for the act than either virgins or married 
women, and their subjects recommended shorter imprisonment for the victimizers 
of divorced women (which may say something about the liberation of Texas 
undergraduates). Shaw & Skolnick (281) found support for a refinement of both 
Lerner (189) and Walster (315) such that they predicted “victims” should not be 
seen as responsible for very positive outcomes since the “insular function” of 
attributed responsibility would be less. Fishbein & Ajzen (93) emphasize “con- 
textual levels” in their discussion of this study. In a developmental study Rule & 
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Duker (268) found severity of outcome to be less attributionally important for 
older boys than younger ones (12 vs 7; also see 248). 


Interpersonal Simulations 


To the relief of us all, there are not many of these and, as might be expected, results 
are not consistent (76, 119, 162, 163). 


Related Work on Balance Theory 


In a series of three excellent studies, Crano & Cooper (64) tested Newcomb’s (231) 
distinction among positively balanced, positively imbalanced, and nonbalanced 
situations. Using all possible pairs of situations subjects were asked to judge which 
situation of the pair was more stable and which more pleasant, thereby using each 
stimulus situation as a comparison point for every other stimulus situation. Al- 
though overall judgments were comparable to previous work, in no case was rated 
pleasantness of the triadic situation scalable, and pleasantness was concluded not 
to be unidimensional. Newcomb’s (231) extension of Heider’s (123) model re- 
garding nonbalanced triads was supported, particularly regarding perceived stabil- 
ity of the triad. In the third study, subjects rated the comprehensibility and 
involvingness of the triads. Situations having a positive P/O bond were rated as 
more involving (comprehensibility was not scalable), supporting Newcomb’s no- 
tion that a negative P/O bond motivates P to disengage from the situation directly 
with a resulting ambivalence toward the situation rather than neutrality. We note 
that Wellens & Thistlethwaite (323) did not find support for Newcomb’s model 
when subjects were asked to estimate the value of missing relationships in a P-O-X 
triad. 

Other work dealt with distinguishing among perceived pleasantness, tension, 
and consistency, as dependent measures (113); the recall of balanced and unbal- 
anced structures (62, 137); similarity between perceiver and actor (105); cross- 
cultural investigations of balance (324); the salience of or preference for incon- 
sistency (125, 280); and further work on the role of ambivalent and neutral 
relationships in dyadic structures (215). Although beyond the scope of this review, 
we note also the detailed work by Anderson and his colleagues, criticizing balance 
theory for the simplicity of its predictions regarding interpersonal perception. A 
number of interesting tests of Anderson’s integration theory have appeared 
(10-12, 129, 272). 


Summary 


Attribution theory has blossomed as a frame of reference for a wide variety of 
research enterprises. However, the theoretical details of this approach are still not 
well articulated, and to the extent that its proponents maintain their inductive bias, 
they may never be. Inductively oriented theorists have never been known for 
challenging or testing their own assumptions. Partly as a result, there seems to be 
really several theories of attribution. To sharpen our understanding of the attri- 
bution process, it would be helpful to bring the approach into more frequent 
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experimental confrontation with other theories. We do not take the position that 
theory A is right and theory B wrong. Surely both have a germ of truth that 
resonates to all of us, but only through a process of theoretical confrontation can 
we separate the wheat from the chaff and discover the touch of reality that each 
represents. Some claim there is no such thing as a critical test of a theory and they 
are probably right, but we certainly progress more quickly when we try to confront 
theories with their own ambiguities. As yet, research on attribution to self is in the 
infant stage, and the theory has been used more often to explain other data than to 
generate interesting predictions on its own. However, the recent monograph by 
Jones et al (151) is promising in that regard. 

The work on attributions to self is especially important theoretically from 
several perspectives, even though some of this research is only beginning. For 
example, Weiner (321) discusses how our causal attribution for our own per- 
formance affects out subsequent behavior. The idea that causal attributions for 
previous behavior affect the relationship between attitude and behavior deserves 
serious consideration and should be tied to previous work on the attitude/behav- 
ior problem. Valins & Nisbett (313) discuss how reattribution therapy can be 
effective in inducing substantial change in patients’ behavior. Research on attitude 
change has always been weakest in its implications for gross behavioral change, 
and the thoughtful work of Valins & Nisbett is a step in the right direction. 

Attribution theory has not yet been tied closely to previous work in attitude 
change. Part of the problem is that the question of change itself is ambiguous 
within the theory. One who espouses an extremist attributional position would 
maintain that we don’t change our attitudes but rather simply discover what they 
are. This implicit denial of change not only runs counter to a substantial body of 
work on assessment of attitudes (and motives), but more importantly leaves the 
theorist free to ignore any data that are even remotely challenging. This isola- 
tionist position may please those studying for their preliminary examination, but 
disappoint those who enjoy the excitement of a theoretical or empirical confron- 
tation. 


LEARNING AND CONDITIONING OF ATTITUDES 


The conditioning of attitudes continues to be a popular research topic (85, 295). 
O’Donnell & Brown (237) found conditioning effects of both positive and negative 
attitudes to increase with age (grades 3-12), although a part correlation suggested 
the effect was due to increased awareness of the contingencies. With awareness 
constant, the part correlation between conditioning and age was —.11. Brown & 
Hill (44) suggest that task complexity mediates the relationship between aware- 
ness and conditioning. Kerpelman & Himmelfarb (169) applied the learning 
theory principles of acquisition, partial reinforcement, and counter-conditioning 
to the learning of attitudes in a concept-formation paradigm. Partially rewarded 
subjects were slower to learn the attitude, but more resistant to counter-condi- 
tioning, as predicted. Nisan (233) found that delayed rewards were perceived as 
more valuable than more immediately expected ones, although the relationship of 
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this finding to Mischel’s (222) work on delay of gratification needs to be explored. 
Lohr & Staats (194) demonstrated the generality of their language conditioning 
paradigm by successfully demonstrating conditioning in three Sino-Tibetan lan- 
guages. 

Several interesting studies are less tightly tied to the learning/conditioning 
paradigm. Baron, Bass & Vietze (20) carried out a field study with lower class black 
youth and found, as predicted, that lower levels of praise from a white experi- 
menter were more effective in changing subject’s self-image than were higher 
levels. They explain their effect in terms of subjects’ normative expectations of 
reward and the experimenter’s lessened credibility when he deviated from ex- 
pected reward levels. This idea of deviation from expected interaction rates could 
be applied also to other work. For example, Stang (296) found that interaction rate 
had a positive relationship to leadership ratings, but a curvilinear relationship to 
interpersonal attraction. Zillman (349) found the use of rhetorical questions (pre- 
sumably producing agreeing responses) to be more effective in persuasion than the 
more typical declarative statements (also see 273). 


ATTITUDINAL EFFECTS OF “MERE” EXPOSURE Zajonc’s (340, 341) finding that the 
repeated exposure of a stimulus enhances its attractiveness continues to provoke 
discussion. Burgess & Sales (45) explain the effect in terms of classical condition- 
ing and suggest that contextual factors (e.g. attitude toward the experimenter or 
the laboratory) act as higher-order unconditioned stimuli. In two studies they 
found exposure contexts and frequencies to interact. Stimuli presented in positive 
contexts become more positive with increasing exposure, while stimuli in negative 
contexts became more negative with increasing frequency of exposure. A similar 
finding was also obtained by Perlman & Oskamp (250) and Brickman et al (42). 


SOCIAL JUDGMENT THEORY 


The most important paper in this area reported a series of five studies by Sherif et 
al (283) on involvement. In Study I, they found highly involved subjects to use 
fewer freely chosen categories to describe their attitudes, rejected more beliefs, 
and were noncommittal toward fewer statements (than were less involved subjects 
in each case). In Study Il, a field study, involved subjects used fewer categories to 
describe their attitudes towards objects higher in the priorities of their reference 
group. In Study IH, subjects judged fewer communicators as credible regarding 
involving issues than uninvolving issues. Sherif et al conclude that the greater the 
involvement the more selective a person becomes in judging a communicator as 
credible and the more likely the subject will contrast a position discrepant from his 
own on that issue, thereby widening the gap between the communication and his 
own position. The results of Study IV indicated less attitude change in response to 
a communication on an involving topic than to a communication on a less in- 
volving topic. In Study V, it was found that degree of involvement, but not 
attitude, predicted response to a request for acting on one’s attitude. For example, 
63% of subjects attended a meeting on a highly involving topic (defined by 
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own-categories procedures), 48% on a somewhat less involving issue, and 38% on a 
moderately involving issue. Taken together these studies represent a substantial 
advance in our knowledge of the relationship of attitude to ego-involvement (also. 
see 82, 275, 301). 

Rhine & Polowniak (256) report finding an increase in degree of involvement 
following attitude change and note the relationship of involvement to the study of 
commitment. Halverson & Pallak (114) investigated the relationship of commit- 
ment and involvement directly by measuring the latitudes of acceptance, non- 
commitment, and rejection of subjects who had recorded a speech consistent with 
their opinion under conditions of public identification or anonymity. Committed 
(public identification) subjects decreased their latitude of noncommitment and 
increased both the latitudes of acceptance and rejection. Halverson & Pallak 
conclude that the effects of commitment on resistance to attack and susceptibility 
to extreme appeals (171) may be mediated by a process of cognitive restructuring. 


REACTANCE 


Brehm (39) hypothesized that when a person’s perceived freedom is threatened, he 
feels reactance, and acts in a manner to restore his perceived freedom. The basic 
reactance effect was replicated several times during the period of this review and 
seems very reliable (40, 71, 110, 124, 243, 335), although one failure to replicate 
was noted (74). Several investigators demonstrated conditions under which the 
resistance effect may be attenuated, such as: when the restriction in freedom is 
implemented by a social norm (110); when the subject is committed to future 
interaction with the person restricting his freedom (243); and when the person is 
prevented from inferring that the other intended to restrict his freedom (124). 
Various interesting extensions of the model were tested. Berkowitz (31) inter- 
prets people’s unwillingness to respond to requests for help as being due to 
reactance and relates the reactance model to a wide variety of studies on helping 
behavior. Linder, Wortman & Brehm (193) found that the attractiveness of de- 
cision alternatives converges as the decision point approaches in time. This con- 
vergence should decrease reactance since apparently perceived choice increases 
when the difference in attractiveness between alternatives lessens (80, 120). Wor- 
chel & Brehm (335) found that reactance can be reduced almost vicariously, When 
another person under the same threat to freedom was observed to act so as to 
restore his freedom, the subject apparently felt little need to reduce his own 
reactance. Also, Brehm & Rozen (40) found that when an attractive new choice 
alternative was introduced to the subject, the attractiveness of old ones increased. 
Presumably, the subject felt his freedom to choose one of the old alternatives was 
threatened. 


SELF-AWARENESS 


Duval, Wicklund, and associates (77, 78, 192, 326, 327) present a very promising 
new theory of objective self-awareness. They treat the objects of conscious atten- 
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tion as a dichotomy and suggest that at any given point in time either self or 
external events are the major focus of attention. In subjective self-awareness the 
person’s focus of attention is outward and evaluation of other people or objects 
takes place. In objective self-awareness (i.e. self as object) the person sees himself 
as a distinct event in the world, and is very sensitive to any incorrectness or 
inconsistency among his beliefs or behavior. Duval & Wicklund apply this notion 
rather broadly to such issues as conformity, dissonance, attribution theory, and 
social facilitation. 

Wicklund & Duval (326) found that subjects completing an attitude scale while 
listening to a tape of their own voice presented their opinions as more like their 
peers’ than did subjects listening to someone else’s voice. In the same paper, a 
study is reported in which subjects copying counterattitudinal material while 
confronting a television camera changed their opinion to reduce the attitude/ 
behavior discrepancy more than did subjects for whom the camera was turned 
away. Wicklund & Duval indicate that objective self-awareness can be seen as 
one mediator of dissonance reduction. In a third study, objective self-awareness 
is seen as mediating social facilitation. Subjects copying letters in front of a mirror 
copied more than subjects not facing a mirror. Duval & Wicklund (78) found the 
attribution of causality to self was increased when attention was focused on self, 
independently of whether the causal consequences were good or bad. Wicklund & 
Ickes (327) found that subjects making a decision while listening to a tape of their 
own voice requested more predecisional information and indicated they would be 
more certain of choice than did subjects hearing a recording of another’s voice. 
These studies of objective self-awareness are original and stimulating, and the 
reader is referred to the book by Duval & Wicklund for further discussion and 
detail. 


COMMITMENT 


Kiesler (171) has continued his work in the field of commitment to consonant 
behavior. His monograph summarizes the work to that date, describes six new 
experiments, and outlines the relationship of commitment to attribution notions 
and the writing of Kurt Lewin. As mentioned previously, the process of commit- 
ment is not easily seen as a simple one of attribution to self. Several experiments 
are related to this issue. Halverson & Pallak (114) found publicly committed 
subjects to decrease their latitude of noncommitment in the absence of other 
manipulations. Hoyt & Centers (135) found that if subjects were allowed to write 
any essay they wished, publicly committed subjects chose to espouse a more 
moderate position than did subjects writing anonymously. Pallak et al (244) found 
increasing commitment led one to be more responsive to an extreme but conso- 
nant communication. Kiesler, Pallak & Archer (176) found this effect to be limited 
to legitimate communicators. When faced with an illegitimate communicator, 
committed subjects responded negatively and increased the distance between own 
attitude and that of the speaker, compared to less committed subjects, regardless 
of the position advocated by the communicator. Salancik & Kiesler (269), in a 
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study of commitment and the learning and retention of attitudinally related 
word-pairs, found committed subjects to recall a significantly greater number of 
consistent word-pairs and significantly fewer inconsistent and irrelevant word- 
pairs than did less committed subjects (also see 291). They suggest that commit- 
ment can affect both the salience of certain cognitions and their relatedness. 
Kiesler, Roth & Pallak (177) found committed subjects to actively avoid thinking 
about the attitudinal implications of their behavior, a process that belies a simple 
inference following behavior. In two other studies (174, 175) committed subjects 
were more likely to act on their beliefs, but only if attacked. Jones & Kiesler (157) 
- found in three studies that committed subjects responded negatively (boomerang 
effect) to forewarning of attack, while uncommitted subjects showed anticipatory 
attitude change in the direction of the expected attack. 


STIMULUS INCONGRUITY THEORY 


Kiesler (172) has proposed a motivational theory of stimulus incongruity to ac- 
count for data on dissonance, attribution, reactance, commitment, and self- 
awareness. His model is based on previous work in perception, attention and 
vigilance, information processing, and ecological psychology. Others have shown 
that organisms become aroused when faced with incongruous stimuli (those which 
are novel, distinctive, or unusual). Kiesler hypothesizes that people become 
aroused when faced with such incongruous stimuli, whether the stimulus is physi- 
calistic, their own behavior, or someone else’s behavior. The aroused organism is 
presumed to reduce the incongruity by identifying or “explaining” the incon- 
gruous behavior. Thus the instigation to act and the direction of the action are 
central components of the theory. One’s own behavior could appear less incon- 
gruous to self through such means as attitude change, distortion of volitional 
contingencies, distraction, or misattribution of the arousal state. 


PERSONALITY VARIABLES 


SELF-ESTEEM Dinner, Lewkowicz & Cooper (73) followed up on the previous 
work by McGuire & Millman (208) on anticipatory attitude change. Dinner et al 
found high-esteem subjects to show more anticipatory attitude change when 
forewarned of a communication on a familiar issue, while low self-esteem subjects 
changed more when expecting a communication on an unfamiliar issue. Dinner et 
al point to the unclarity in McGuire & Millman’s original distinction between 
emotional and technical issues (see also 68, 157) and suggest their concept of 
familiarity with the issue is easier to work with. 

In a more traditional line of research, Skolnick & Heslin (288) argue that the 
relationship between self-esteem and persuasibility should depend on the quality 
of the arguments in a communication. Using arguments contained in com- 
munications in other studies, they were able to show that arguments produc- 
ing an inverted-U or positive relationship between self-esteem and persuasibility 
in other research were rated as more persuasible, valid, and logical than were 
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arguments from studies in which a negative relationship was found. They found 
no differences in rated difficulty or complexity of the arguments, leading them to 
conclude that quality of the persuasive communication is a more potent predictor 
of the relationship between self-esteem and opinion change than is difficulty or 
. complexity of the communication. 

Eagly & Acksen (79) were concerned with the bases for change in self-esteem. 
They found that ordinarily favorable information about self was accepted more 
readily than unfavorable information, but that this effect was reversed when the 
subject expected further evaluations in the future (also see 223). Presumably the 
potential consequences of changing beliefs about self determined one’s receptivity 
to evaluative information about self. For a review of the literature on our evalua- 
tions of others as a function of own self-esteem, see Jones (160). 


DOGMATISM Hodges & Byrne (131) suggest that one reason we respond so neg- 
atively to dissimilar attitudes is that they challenge our competence. They hy- 
pothesized that simply changing the wording of an attitude statement to be 
phrased more dogmatically should increase the threatening nature of disagreeing 
attitudes. In two different studies, strangers either agreed or disagreed with the 
subject, and phrased their beliefs either as dogmatic or openminded. In both 
experiments the phrasing had little effect on perceived attractiveness of attitudi- 
nally similar strangers, but strangers with different attitudes were rejected more 
when their attitudes were dogmatically phrased. 

In a study which lasted over a 10-week interaction period, Jacoby (143) found 
that people low in dogmatism were significantly more accurate in perceiving 
co-workers’ dogmatism score than were those high in dogmatism. Apparently, 
dogmatic people are unable to improve their accuracy of perceiving others over 
time. (For other studies related to dogmatism, see 65, 87, 142, 224, 290.) 

In a study of values (Rokeach Value Survey), Starck (297) found that one’s 
preferences for information sources varied as a function of specific values. Al- 
though the majority of respondents preferred interpersonal sources of informa- 
tion (family, friends), those valuing “inner harmony” preferred printed sources of 
information (over electronic sources and public events), while those valuing a 
“comfortable life” preferred electronic sources. Either this says something about 
middle-class America, or else it is the final word on the real decibel level of TV 
commericals. 


SOURCE OR COMMUNICATOR EFFECTS 


Mills & Harvey (217) tested Kelman’s (167) conceptualization of the processes of 
identification and internalization. In identification, one theoretically accepts an 
influence attempt because one likes the communicator and wants to be like him. In 
internalization, one accepts the influence attempt because the change is congruent 
with belief and is itself rewarding. The former, Mills & Harvey say, implies an 
attractive communicator, and the latter implies a credible communicator. They 
suggest the timing of information about the communicator would be critical in 
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acceptance of his message. As predicted, it made little difference whether 
information regarding the attractiveness of the communicator came before or 
after the message. However, an expert communicator was effective only if his 
expertise was known prior to receipt of his message. The data are interpreted as 
support for Kelman’s distinction between the processes of internalization and 
identification. 

Ference (91) set up an experimental situation in which the subjects received 
information from two different sources on each of several trials. He varied both 
the conflict between the sources and their relative accuracy (ability). Subjects’ 
acceptance of information (attitude change) from the sources varied as a joint 
function of both source conflict and accuracy. Ference’s paradigm is a neat one for 
studying source effects. Subjects were asked to make judgments regarding admis- 
sions of new students to the university. Actual cases were used, with all applicants 
being accepted but some failing to graduate. Subject’s task was to determine who 
had failed. Letters ofevaluation were included from two different sources who had 
interviewed each applicant. The dependent measure was the extent to which 
subject’s response approximated each source. It is a plausible paradigm for man- 
ipulating both source and message variables. 

Landy (186) varied the attractiveness of an audience and its reaction to a 
speaker. Subjects were more influenced by a speaker when they felt an attractive 
audience had listened to him, regardless of the reaction of the audience. Attitude 
change was greatest when an attractive audience applauded the speaker and 
(surprisingly) least when an unattractive audience jeered the speaker. The speaker 
was perceived more positively when listened to by an attractive audience. Landy 
speculates that one evaluates a persuasive communicator by knowing who else is 
paying attention to what is said. [See also (247) regarding the effect of immediacy 
of the interaction between speaker and relevant others.] Snyder & Rothbart (294) 
found the increased impact of an attractive communicator to be due more to 
subject’s liking for him than to his perceived credibility. Cialdini et al (53) found 
that when a person expects to interact with another on an issue, he will moderate 
his position prior to the interaction, although the effect was apparently only a 
strategic response and did not persist over time. 

This seems as good a context as any to mention the provocative chapter by 
Moscovici & Faucheux (227) on minority influence in the Berkowitz series. They 
criticize the experimental paradigm in which the group is seen as the active 
influencing agent and in which the individual or minority must conform to or 
deviate from the group’s prescriptions. They argue convincingly for research 
studying how and when minorities (or individuals) can influence majorities (or 
other groups), leading to a better understanding of innovation. 


FOREWARNING 


As mentioned, Dinner et al (73) found high self-esteem subjects to show more 
anticipatory attitude change when forewarned of a communication on a familiar 
topic, while low self-esteem subjects changed more when expecting a communi- 
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cation on an unfamiliar topic. Deaux (68) suggests that anticipatory change is not 
related to either high or low self-esteem (manipulated or measured), but rather 
occurs when one is uncertain of his self-esteem. Jones & Kiesler (157) found when 
subjects were uncommitted on an issue, forewarning of an attack led them to 
abandon their belief. Committed subjects, on the other hand, intensified their 
beliefs when forewarned of attack. Jones & Kiesler found their interaction to hold 
in three different experiments and three different variations in commitment 
ranging from issues to direct manipulations of commitment. 

Hass & Linder (122) studied the interaction of forewarning and the order of 
arguments in one- and two-sided messages. Somewhat surprisingly, forewarned 
subjects responded negatively to the placement of implicit refutational arguments 
(versus pro-arguments) first in a countercommunication. Subjects who were not 
forewarned attended less to the sequence of the arguments. This result persisted 
over three studies. Perhaps the refutational arguments amounted to an inadequate 
attack which only aroused subjects’ defenses. 


INTERPERSONAL ATTRACTION 


Space limitations and personal inclinations preclude a thorough review of re- 
search on interpersonal attraction. We will only touch lightly on some issues that 
have interested readers of this series of reviews in the past. 


SIMILARITY AND ATTRACTION Byrne’s attraction/reinforcement paradigm has 
continued to receive empirical attention, and his monograph (47) systematically 
outlines the voluminous research in this area. Other workers have extended and 
modified the basic finding. 

Mitchell & Byrne (224) applied the attraction paradigm to the study of jury 
decisions. In a mock hearing, similarity of attitudes of juror and defendent directly 
affected the perceived attractiveness of the defendent. For both authoritarians and 
egalitarians, jurors’ attitude similarity increased the attractiveness of the defen- 
dent. Interestingly, attitude similarity interacted with juror’s authoritarianism in 
affecting recommended punishment. Authoritarians recommended significantly 
more severe punishment for a dissimilar other, while egalitarians showed no bias 
in recommended punishment. Within this restricted situation, attitudes toward the 
other and behavior toward him did not fit together neatly, at least for egalitarians. 

Mascaro & Graves (199) applied adaptation-level theory to the attraction par- 
adigm. Their subjects evaluated two stimulus people. Perceived similarity and 
attraction to the second stimulus person (50% attitude agreement) varied as a 
function of the attitudinal similarity of the first person (90%, 50% or 10% agree- 
ment). When the first person had been very similar to the subject, the second was 
perceived negatively in contrast in both perceived similarity and attraction. When 
the first stimulus person was very dissimilar, the second was perceived more 
positively. 

Other work found the basic similarity/attraction effect to depend at least 
somewhat on: self’s affective state (109), the basis of the attitude (21, 104, 170, 
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305); the basis of future interaction with the other (147); how explicitly the 
potential evaluation of the other is suggested (139); the relevance of the situation 
for the subject (111, 213); and the salience of the attitudinal disagreements (107). 
As one last titilating note, Bleda (35) found that attitude similarity directly affected 
female subjects’ perceptions of the height of the other; the greater the attitude 
similarity, the taller the other was perceived, regardless of the height of the subject. 


ORDER EFFECTS Mettee (211, 212) carried out two studies to extend and refine the 
well-known gain-loss model of attraction proposed by Aronson & Linder (14, 16; 
also see 154a). His findings indicate the effect depends upon both the degree of 
affect communicated in the separate parts of the sequence and the inferred dis- 
cernment of the other. Discernment is suggested to involve two potentially inde- 
pendent factors, credibility and discrimination of the other. Mettee showed that 
when self is evaluated in a negative/positive sequence on two separate tasks, 
attraction to the evaluator is greater than in a positive/ positive sequence (much as 
Aronson & Linder had found originally). However, the opposite effect occurs if 
the sequence of evaluations occurs within the same task. In the latter case, the 
other is presumably seen as a less credible source of information. 


PREDICTABILITY See S. Kiesler (179) proposes that generally we dislike 
unpredictable others. In two experiments, she demonstrated that (a) situational 
constraints can influence individual preferences for predictability or unpre- 
dictability in others; (b) one who acts inappropriately is perceived as relatively 
unpredictable; and (c) whether one dislikes the inappropriate, unpredictable 
other depends on specifiable situational variations. 


OTHER WORK We cannot leave this topic without commending to the reader the 
excellent little book on theories of attraction and love edited by Murstein (229) 
and containing excellent articles by Walster, Newcomb, and others, as well as a 
provocative critique by Murstein of research in the attraction paradigm. 


4 


CONCLUSION 


We have continued a tradition of avoiding the definition of attitude and attitude 
change by simply reviewing that body of research that self-defined attitude re- 
searchers have carried out in the last 3 years. We were surprised, however, after a 
systematic review, to learn how little research had been done on a variety of issues 
that have received heavy attention in the past. To wit, astonishingly little research 
has been done recently on the classic breakdown of the persuasion setting: varia- 
bles relating to the communicator, the communication, and the recipient of the 
communication. There has been little investigation of fear appeals, order effects, 
the dissonance-discrepancy controversy, immunization, selective exposure, or at- 
titude change in natural settings. 

Why? Part of the problem may be the decline of popularity of dissonance 
theory, partly due to attacks on the attitude concept itself (1), and partly due to 
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dissatisfaction expressed by respected colleagues with deductive theory (98, 99) 
and laboratory experiments (207). 

McGuire (207) has recently criticized both theory-derived hypotheses and cur- 
rent research in the field. He argues for systems theory and multivariate correla- 
tional designs, and emphasizes hypothesis generation rather than hypothesis test- 
ing. Gergen (98, 99) suggests theories are primarily a reflection of contemporary 
history; that science and society form a feedback loop, such that science affects 
society and is thereby itself changed. Both have a point, of course. However, we 
disagree on several grounds. Schlenker (274) has pointed out the difference 
between empirical generalizations, so tied to our own culture, and abstract uni- 
versal propositions which underlie scientific theorizing. While we accept 
McGuire’s criticism of social psychological thinking as being distressingly uni- 
variate in tone, he has failed to consider that the multivariate systems approach to 
social problems is especially susceptible to influence or domination by cultural 
variables (and random error) which change over time. 

Systems theory, computer simulations, and inductive theorizing, while to be 
encouraged on other grounds, have a common fault: they don’t lead very easily to 
empirical tests of one’s thinking. One is reminded of the ancient enthusiasm about 
factor-analysis as a technique to handle one’s longing for the “big picture.” It is a 
useful technique with proven caution associated with conclusions. However, fac- 
tor-analysis, like systems theory, often capitalizes on the error variance associated 
with a testing of a single instance. Like a computer simulation, it is limited by the 
sampling bias in the data thrown in the hopper. Like inductive theorizing. it 
seldom leads to informative tests of one’s underlying assumptions. Space limits a 
more detailed consideration of these complicated and important issues here, but 
we expect these issues to be hotly contested in the near future. 

We, of course, are much more committed to the experimental method than 
McGuire (207) currently claims to be, and more appreciative of deductive theory 
and the mathematical logic that underlies it than Gergen (98, 99) admits to. 
However, we are sympathetic to the feelings behind their words—that as a field we 
have become too rigidly deductive and too totally laboratory oriented. Social 
psychology must surely swing back to a better balance between theory and ob- 
servation, between laboratory and social problems. Yin and Yang should be in 
balance, not in alternate cycles. Social psychology and the study of attitude change 
is in the middle of that swing now, we think. However, a method/topic shift 
demands new theory and new approaches, and we are only now pausing to ponder 
these issues. As a result, attitude change is not the thriving field it once was and will 
be again. 
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The literature published since the previous review of this topic by Miner & 
Dachler (65) shows several trends. There has been a dramatic decrease in the 
popularity of equity theory, especially with respect to explaining job and task 
performance. Since the Pritchard et almonograph (76), the few studies concerned 
with equity have stressed its role in influencing pay satisfaction. Competent 
criticisms of equity theory’s vagueness (106) and limited applicability may have 
contributed to its current lack of popularity. Need for achievement is also 
encountering increasing criticism (25), and the validity of Herzberg’s theory 

remains in doubt (56). 

_ Inthe applied realm, job enrichment and Management by Objectives continue 
to attract the attention of practitioners. But participatory management is the most 
frequently recommended panacea for problems pertaining to employee motiva- 
tion (61) despite equivocal evidence (18). 

In the theoretical realm, expectancy or V-I-E (valence-instrumentality-expec- 
tancy) theory has clearly become the most popular approach to motivation among 
industrial researchers (e.g. 48). However, there are still those who consider factor 
analysis and/or huge correlation matrices involving measurements of dozens 
of variables to be the key to unlocking the secrets of motivational psychology. 
(For this writer’s comments on the latter approach, see 52, 54, 56). Organiza- 
tional climate has also become popular, but the problem lies in determining just 
what the concept means (36). Space limitations prevent further discussion of this 
concept here, but for a thorough review of the topic see (37a). 

Two other areas which continue to attract the attention of researchers are pay 
(97) and the relationship between job attitudes and behavior (73). 

This article will be a selective review and critique of major theoretical ap- 
proaches and major research areas relevant to employee attitudes and motivation. 


l The author would like to thank Dr. Harry Binswanger for his helpful suggestions and 
ideas regarding some of the issues involved in the theory of psychological hedonism. 
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EXPECTANCY (V-I-E) THEORY 


Previous reviews of and commentaries on expectancy theory have identified many 
of the practical problems involved in testing the basic V-I-E model, e.g. the 
problem of.how to measure the various components or which components to 
measure; the need to include measures of role perceptions; the need to specify 
both intrinsic and extrinsic outcomes; the need for relevant measures of perfor- 
mance; and the problem of causality when interpreting the results of con- 
current studies, etc (20, 38, 65). 

One could add to this the need to control extraneous variables carefully when 
testing the explanatory power of expectancy theory. For example, correlations 
between expectancy and performance are typically computed without controlling 
for past performance level or for ability. Since self-perceptions of ability (which 
are presumably based on past performance and on actual ability) are correlated 
with performance (72), such controls are crucial if spurious expectancy-perfor- 
mance correlations are to be avoided. Only a few studies have shown evidence for 
expectancy effects with ability controlled (3, 70). Goal level is another variable 
that has not been adequately controlled in most V-I-E studies (19). 

An examination of the research reported since Miner & Dachler’s review reveals 
that many of these problems are still prevalent. Thus the conclusions to be drawn 
from it are necessarily equivocal. 

For example, there are no consistent findings regarding which components are 
the best predictors of performance. Sometimes V works best (77). Sometimes I 
works best (67). More often E works best (3, 29, 85). The total V-I-E formula may 
predict performance better than any component by itself, or it may not. An 
additive model may work better than a multiplicative one (85), or there may be no 
difference (68). There have been no convincing explanations offered as to why 
whatever does work works, or why whatever does not work does not. 

The expectancy model usually predicts self-ratings of effort, attitudes, and/or 
performance more successfully than it predicts supervisor’s performance ratings 
or objectively measured performance (4, 49, 66, 68, 77, 98). This immediately 
makes the former results suspect, since response bias could account for them. 

Generally the correlations between V-I-E constructs and objective performance 
(or supervisory ratings) in real-life settings are low (4-6, 20, 35, 66, 68, 77, 85, 86). 
The results using longitudinal designs in order to rule out the possibility of spur- 
ious concurrent correlations [e.g. individuals attempting to account logically for 
their own behavior after-the-fact (11)] are no better (49, 87). 

Dachler & Mobley have pointed out some limitations of expectancy theory by 
showing that its ability to predict actual performance may be restricted to situa- 
tions in which the contingencies between performance and work outcomes are 
clearly established and are clearly perceived by the employees (20, 21). Consistent 
with this hypothesis is the finding that expectancy theory tends to work best in 
situations where rewards are objectively geared to effort and performance, e.g. 
piece rate incentive systems (20, 44). In such situations performance-reward 
instrumentalities tend to be highest (16, 20, 75, 84). Even under these ideal con- 


PERSONNEL ATTITUDES AND MOTIVATION 459 


ditions, however, there are puzzling failures of the theory, such as low valence 
subjects outperforming high valence subjects, despite instrumentality and ability 
being controlled (75). 

An increasingly popular area of research related to expectancy theory has been 
labeled “attribution theory,” and entails the study of the factors to which individ- 
uals attribute their successes and failures (107). Most such studies have focused 
on the determinants or effects of attribnting success or failure on a task to: effort, 
luck, ability, and/or task difficulty. There have been attempts to integrate attri- 
bution research with Rotter’s concept of Internal vs External (J-E) locus of control 
(62, 107). However, the precise meaning of the I-E scale is equivocal (107), though 
some studies have found that individuals high on I persist longer after failure than 
those high on E (109). The types of causal attributions an individual makes after 
success and failure are relevant to expectancy theory, since such attributions can 
influence the degree of affect experienced (and therefore valence) as well as 
subsequent expectancies. 

As noted earlier, most previous discussions of expectancy theory research have 
focused on design flaws in tests of the theory. While such methodological prob- 
lems have received ample attention, problems with expectancy theory itself have 
gone largely unnoticed or unmentioned until recently. 

This writer believes that a thorough critique of the basic assumptions of expec- 
tancy theory is not only appropriate but long overdue. Thus the remainder of this 
section will be devoted to it. It is hoped that this critique will stimulate thinking 
and discussion of the issues involved and will lead, at the very least, to a less naive 
approach to the study of human motivation in general and to the study of work 
motivation in particular. 


Critique of Expectancy Theory 


HEDONISM As formulated by industrial psychologists (111), expectancy theory is 
a form of calculative, psychological hedonism in which the ultimate motive of 
every human act is asserted to be the maximization of pleasure and/or the 
minimization of pain. The individual always chooses that course of action which 
he expects will lead to the greatest degree of pleasure or which will produce the 
smallest degree of pain. In Vroom’s model, which is the prototype of all expec- 
tancy theories in industrial psychology, the individual acts to maximize his “va- 
lences,” which Vroom defines as “expected satisfactions” (111, p. 15). On this issue 
it is worth pointing out that the theories of motivation from which V-I-E theory 
was derived (e.g. Lewin, etc) did not make any explicit hedonistic assumptions (8). 

There is a very crude sense in which it is true that individuals are motivated to 
attain pleasure and to avoid pain. All individuals (in fact, all animals) are born 
with an automatic sensory-level pleasure-pain mechanism. This mechanism 
operates to inform the organism that certain actions are: compatible with its 
survival and well-being or contrary to its welfare. This built-in mechanisms is the 
basis for the development of values and emotions which the individual subse- 
quently acquires through learning. 

While the body’s pleasure-pain mechanism provides the biological basis for the 
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experience of valuing, it does not follow that individuals will necessarily choose 
objects or actions solely on the basis of the degree of pleasure they expect their 
choices to bring. If this were true, we would expect that virtually everyone except 
the very poorest “calculators” would be happy. If a person failed to maximize his 
pleasure by taking one course of action, he would automatically switch to another 
until happiness was attained. The existence of neurosis, drug and alcohol addic- 
tion, suicides, and large numbers of chronically unhappy people would seem to 
belie this implication. 

Even more embarrassing for the theory of hedonism is the existence of individ- 
uals who deliberately renounce their own pleasure, e.g. because they believe 
pleasure to be evil, as with St. Francis of Assisi; or because they believe it is their 
duty to take actions which they dislike, as with some men who give up their chosen 
careers in order to enter the family business; or because they believe self-sacrifice 
to be virtuous, as with any practicing altruist. 

The hedonist usually attempts to answer such criticisms by arguing that these 
individuals must have decided that they would get more pleasure or less pain from 
the actions they chose to perform than they would have attained from more 
obviously self-serving activities. But here the hedonist becomes equivocal. He. 
assumes that simply because a person was motivated to act in a certain way, i.e. that 
because he wanted to do one thing rather than another, that this fact alone proves 
he was motivated by pleasure. Vroom himself uses the term “preference” as a 
synonym for “anticipated satisfaction” (111, p. 15) on the implicit premise that 
only (anticipated) pleasure can be a basis for preference. 

If the hedonist were to argue that the existence of preferences as such was proof 
of the validity of the theory of hedonism, he would be, in effect, assuming in 
advance the validity of the theory which he was attempting to prove, i.e. the 
doctrine that choices are regulated by anticipated pleasure. In logic, this is known 
as the fallacy of question-begging. 

To avoid the accusation of circularity, the hedonist must prove, not that all 
behavior is motivated or that people have preferences, but that expected pleasure 
is the sole basis for all preferences. 

If he wished to assert instead that hedonism is not an hypothesis about moti- 
vation but an axiom (a fundamental assumption that forms the base of motiva- 
tional psychology and which cannot be opposed without self-contradiction), then 
it would be incumbent upon him to show why (or that) it was axiomatic. 

It should be added that if pleasure were defined so widely as to be taken as 
synonymous with the term “motivation” or “preference,” the theory would not be 
very useful in explaining individual differences in choices and actions. For exam- 
ple, even though both sexual activity and sexual abstinence are motivated (i.e. 
caused), there is a radical difference in the motivational basis of these differing 
actions, e.g. in the view each practitioner has of himself, of his own body, in the 
nature of his self-esteem, in his view of pleasure, etc. Even if one were to argue that 
an ascetic gets pleasure by abstaining from pleasure, it would have to be ac- 
knowledged that what the ascetic means by pleasure is radically different from what 
the nonascetic means by it. 
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There is a related problem with the doctrine of hedonism which pertains to the — 
relationship between pleasure and pain. Calculations of expected pleasure under 
expectancy theory are made on the implicit premise that “units” of pleasure and 
pain are interchangeable, e.g. that eliminating 10 units of pain is equivalent to 
adding 10 units of pleasure. In short, it assumes that pleasure and pain differ only 
quantitatively, not qualitatively. 

This premise is difficult to reconcile with clinical and experimental observations 
to the effect that there is a fundamental difference in kind between motivation by 
positives and motivation by negatives. There are enormous qualitative differences 
between individuals who seek to attain values and those who seek to avoid losing 
them. Motivation by desire is not the same as motivation by fear (9). For example, 
a student who sets his grade aspirations at a very low level so that he will not be 
disappointed if he does poorly is guided by very different principles of choice than 
one who is motivated to attain the best grades that he can. And the experiences in 
the two cases are not the same. Relief is not the same as pleasure. The “pleasure” 
of drinking oneself into an alcoholic stupor in order to avoid thinking about one’s 
problems is not equivalent to the pleasure of working hard to succeed on a job that 
one loves. 

Thus, even if one were to grant the basic premise of hedonism, there would have 
to be at least two separate versions of it, one focused on attaining pleasure and the 
other on avoiding pain. Since, however, people can focus on both of these goals in 
varying degrees, (and in varying degrees at different times), this would make for 
several additional versions, each with its own guiding principles. 

Unfortunately, even the above modification would not suffice. It has been 
pointed out that individuals often act to “satisfice,” i.e. to attain some minimum 
level of satisfaction, rather than to maximize pleasure (11-13). Since presumably 
one can also “negafice” (avoid some maximum degree of dissatisfaction), these 
principles, combined with those above, would produce an enormous number of 
motivating principles (e.g. pleasure satisficers, pain minimizers, pleasure maxi- 
mizers, pain negaficers, combination pleasure satisficers and pain minimizers, etc). 
Such elaborations would lead to the virtual collapse of hedonism as a fundamental 
motivational principle. 

But we are not through with the problems of hedonism yet. Since any adequate 
theory of motivation has to explain thoughts as well as overt actions (since 
thoughts are a type of action and therefore must be caused, i.e. motivated), 
hedonism would lead to the prediction that people should do everything in their 
power to avoid unpleasant thoughts, i.e. thoughts which cause them anxiety, 
dissatisfaction, self-doubt, guilt, depression, etc. 

A policy of consistently avoiding unpleasant thoughts through extensive use of 
defense mechanisms and the like would lead to mental illness, however. One 
requirement of psychological health is a policy of not resorting to defensive 
mechanisms in order to avoid disturbing thoughts (and, as a corollary, not letting 
pleasurable thoughts, e.g. wishes, substitute for one’s rational judgment.) 

A final critique of hedonism is that the doctrine is by its nature deterministic, Le. 
itis a form of psychological determinism. Hedonism, in arguing that men inevitably, 
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by their nature, base their choices and actions on calculations of expected pleasure 
and pain, implies that they cannot act otherwise. The doctrine in effect asserts that 
men are powerless to resist the urges of their feelings and emotions. They are 
assumed to be helplessly pushed and pulled by their desires and fears, their 
ultimate choices in a given situation being the resultant of the total forces (i.e. 
feelings) acting on them at the time. To the degree that rational thought is 
employed, it is only the the service of one’s feelings. 

The assumption of determinism wipes out the possibility of ethics (the science 
that identifies the principles that should guide men’s actions), since ethics presup- 
poses choice (78). This doctrine also wipes out the possibility of knowledge, since it 
implies that men can only believe what they were forced to believe by forces 
beyond their control. According to the theory of determinism, for example, the 
only reason people advocate the doctrine of hedonism is that they feel good when 
they think about it and cannot, therefore, resist endorsing it (50)! 

Determinism ignores the fact that man has a conceptual form of cognition 
which allows him to identify his feelings and the values which give rise to them, 
and to decide whether or not to act on them (14). Man’s rational faculty enables 
him to choose the standards which he will use to guide his choices and actions (78). 

A final note with respect to the assumption of hedonism: it is interesting to 
observe that virtually none of the empirical studies designed to test expectancy 
theory have used measures of expected pleasure or expected satisfaction as their 
index of valence. Usually individual subjects or employees are asked to indicate 
their relative preferences for or the relative desirability or importance of various 
outcomes or actions. Such procedures, of course, avoid the assumption that an 
individual’s choices are based on the relative degree of pleasure they will bring or 
the pain they will avoid. The consistency with which directly hedonistic measure- 
ments are avoided in these studies suggests the possibility that advocates of 
expectancy theory do not themselves fully accept its hedonistic assumptions. 

One recent study used a measure of past pleasure experienced when carrying 
out various job activities in addition to the usual valence measures (stressing 
desirability) and found that the former correlated with actual performance better 
than the combined V-I-E measures (98). However, this study did not test the 
possibility that such past pleasure was the result of past effective performance (and 
associated rewards), in which case the correlation would be a spurious result of 
past performance correlating with present performance. 

If individuals cannot properly be described as pursuing pleasure, then what 
does guide their actions? With respect to motivation, it would be more accurate to 
say that individuals strive to attain goals, values, or purposes than to say that they 
strive for pleasure. Even when pleasure is a causal factor in choice, an individual’s 
focus in acting is typically on the object of the action (the goal) rather than on 
pleasure (2). If an individual’s primary focus were on pleasure, he would probably 
not attain it, since it cannot be attained directly, only indirectly, i.e. by attaining 
goals and objects (2). 

Describing human behavior (qua human) as goal-directed or purposive does 
not make the a priori assumption that pleasure and pain are the sole and necessary 
determinants of action. 
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TIME SPAN A capacity which distinguishes man from the lower animals, and one 
which stems from his ability to conceptualize, is the capacity to project the long 
range consequences of his actions. However, since the use of one’s conceptual 
faculty is volitional (since it is exercised by choice), men differ enormously in the 
consistency and persistence with which they project the future. In addition, men 
differ in the ability and accuracy with which they make such projections (de- 
pending upon their experience, knowledge, intelligence, and methods of think- 
ing). Since the possible time span across which men may project their actions and 
their consequences ranges from zero to the range of a lifetime, one can know 
nothing about how a man will act without knowing what time span the individual 
is considering when making his choices. 

Despite observable differences among individuals in the above characteristic, 
expectancy theory has nothing whatever to say about the subject. Evidently there 
is an implicit premise to the effect that all individuals are alike in this respect. 


THE SEARCH-LOAD PROBLEM Just as individuals differ widely in the time span 
across which they project their actions, they differ widely in the number and types 
of actions and consequences which they consider when making their decisions (11, 
12). This is partly an issue of necessity, partly one of style, and partly one of 
motivation. The necessity pertains to differences in mental capacity (cognitive 
load capacity) and differences in knowledge (of the various possible alternatives 
and action outcomes). The style issue pertains to differences in methods of think- 
ing (defense mechanisms; use of reason vs emotion; use of one’s own judgment vs 
conformity to others, etc). The motivation issue involves differences in persistence, 
self-confidence, and what the individual considers important. 

Without knowing what a man will consider when forecasting his actions (e.g. 
how he will go about searching for consequences; when he will stop searching; 
how he will integrate the information obtained), one can predict little or nothing 
about what actions he will take. Again, however, expectancy theory has nothing to 
say about this issue, the implication being that people are basically the same with 
respect to search habits and load capacity. 

This assumption is exemplified in the typical research study in which the 
experimenter defines for the individual the precise outcomes and behaviors he is 
to consider when forecasting his actions, and even makes the subsequent mathe- 
matical V-I-E calculations for the subject—calculations which he could not make 
consciously [and may not actually make at all; see (11)]. 


SUBCONSCIOUS MOTIVATION The previous point brings up an additional problem 
with expectancy theory, namely, the implicit assumption that all motivation is 
conscious. Expectancy theory assumes that the individual consciously calculates 
the expected pleasure and/or pain to be gained or avoided through various actions 
and makes his choice(s) accordingly. 

One does not have to be a confirmed advocate of Freudian psychoanalysis to 
recognize that individuals are not always conscious of their motives, premises, 
values, expectancies, and the like. If an individual is not conscious of all his values 
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and premises, he cannot, of course, make conscious calculations regarding how to 
maximize his pleasure and minimize his pain (even granting that these are his 
ultimate goals). 

Just as individuals differ in the time span across which they project their actions 
and the number and type of alternatives they consider before acting, they differ in 
the degree to which they are aware of the contents and processes of their own 
mind. Like being aware of the external world, being aware of one’s own psychol- 
ogy depends on the desire and the ability to use one’s conceptual faculty (14). 

Again, expectancy theory has nothing to say about subconscious motivation; 
thus it must assume implicitly that subconscious motivation either does not exist or 
is not important. 


IMPULSIVE, EXPRESSIVE, NEUROTIC, AND HABITUAL BEHAVIOR ‘There are certain 
categories of action which even the most confirmed V-I-E advocate could not 
-Claim were taken calculatively. The main feature of impulsive behavior, for 
example, is that it is done. without calculation (hedonistic or otherwise). For 
example, a worker gets angry at his foreman and suddenly, during an argument, 
punches him in the nose. Afterwards he may be very sorry, especially if he is fired 
and subjected to a lawsuit. His explanation will usually be: “T just hit him. I didn’t 
think about what I was doing.” Such action is impulsive rather than instrumental. 
The time span considered in projecting the consequences of such actions is zero. 
The individual is not consciously aiming at a goal, but simply expressing a feeling. 
The same can occur with positive emotions, e.g. running and jumping with joy 
over some happy event. 

There are also actions which the individual recognizes as harmful to his own 
interests, but which he takes anyway because he cannot resist the impulse, as in the 
case of genuine compulsive behavior. 

Neurotically rigid behavior is another example of behavior which does not 
respond consistently or predictably to environmental circumstances (11). 

Habitual behavior is also done without extensive advance calculation, although 
such calculations may have occurred when the habit was first acquired. 

Since the types of behavior described above are not at all uncommon, the failure 
to account for them constitutes another limitation of expectancy theory. 


THE RATIO SCALE PROBLEM It has been pointed out that the formulas involved in 
the basic expectancy theory postulates assume the existence of ratio scales, since 
multiplication of valences, expectancies, and instrumentalities are involved. 
However, there is as yet no known method of measuring values or valences on a 
ratio scale (or even a true interval scale). Thus the form of the theory assumes the 
existence of measurements which do not exist (83). If such measurements do not 
exist, it is difficult to see how individuals, in choosing among action and outcome 
alternatives, could act as if they do. Whatever individuals use to make their 
calculations, it cannot be a ratio scale for valences. 


THE INFINITE REGRESS PROBLEM Expectancy theorists argue that the valence of 
an outcome is the product of the valences of all other outcomes to which this 
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outcome leads and the instrumentality for that outcome in producing these other 
outcomes. Thus (theoretically) each particular valence is explained on the basis of 
other valences. Taken literally, this leads to an infinite regress, since each valence 
would have to be calculated by associating it with other valences and so on ad 
infinitum. In practice, individuals could not possibly make such calculations when 
deciding upon a course of action. The cognitive load would be overwhelming. 

This problem could be resolved by recognizing that valences (i.e. feelings) are 
not psychological primaries. Feelings (and expected feelings) are the conse- 
quences of (subconscious) value judgments (52, 56). The sum of one’s value 
judgments toward an object or outcome are automatically integrated by the 
subconscious and are reflected in one’s emotional reactions to that object or 
outcome. To experience an emotion, the individual does not have to consciously 
focus on everything that is associated with a given object or outcome. His sub- 
conscious will make these associations for him.” However, it does not follow that 
by knowing an individual’s (automatic) emotional reaction to an object one can 
predict his actions toward it—since this assumes the validity of the doctrine of 
hedonism, which has been previously questioned. 

Some of the above criticisms of expectancy theory point to errors of omission in 
the theory (e.g. time span, cognitive load). That is, they are issues about which the 
theory says nothing explicitly but about which something could be said, e.g. by 
expanding the theory. 

Other criticisms, however, entail errors of commission, i.e. problems regarding 
the validity of what was actually said. Some of these, such as hedonism, are 
fundamental to the entire approach of the theory (if not to the practice of re- 
searchers). Others, such as the impulsive behavior problem point to broad areas 
where the theory simply does not apply. Others might be resolved by making 
substantial revisions in the theory, e.g. the subconscious motivation problem, the 
ratio scale problem, the infinite regress problem. 

If the foregoing critiques are accepted and used as a basis for revising the theory, 
however, it is questionable how closely the revised version will resemble the theory 
in its present form. 

An alternative to the hypothetico-deductive approach to motivation of expec- 
tancy theorists would be a more inductive approach designed to discover how 
employees actually make choices among action alternatives. A theory of employee 
motivation could eventually be constructed on the basis of these findings. Such a 
theory, of course, would be highly complex and would require decades to develop. 


GOAL-SETTING AND MANAGEMENT BY OBJECTIVES 


Goal-setting is more appropriately viewed as a motivational technique rather than 
as a formal theory of motivation, despite previous suggestions to the contrary (51). 
Historically, the study of goal-setting stems from two sources: one industrial and 


* It must be stressed that to experience an emotion is not necessarily to be aware that one is 
experiencing it. To know that one is experiencing an emotion, and more importantly to 
know what emotion one is experiencing, requires an act of conceptual identification. 
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one academic. Taylor used assigned goals or “tasks” as one of the key techniques 
of his system of Scientific Management (96). Each workman was given a chal- 
lenging but reachable work goal each day based on the results of time (and 
motion) study. The methods by which he was to reach the goal (e.g. tools to be 
used, work techniques, distribution of rest pauses, etc) were spelled out in detail. 

A modern version of the task idea can be found in the system called Manage- 
mefit by Objectives (MBO). MBO is basically a method of developing a hierar- 
chically integrated network of goals throughout all managerial levels of an 
organization (18). 

In the academic realm, concepts like “task,” “set,” and “intention” were used by 
psychologists in the Wurzburg school to explain the results of their laboratory 
experiments. During the 1930s American psychologists in this tradition performed 
extensive studies of “level of aspiration” (quantitative goal setting), but they were 
more interested in the causes of aspiration level than in its effects. 

Recent studies of goal-setting and its effects have been summarized by Locke 
(51) and Ryan (82). Pritchard & Curtis (74) conducted one of the few laboratory 
studies of goal-setting since Miner & Dachler’s review 2 years ago. They replicated 
Locke’s finding that moderately hard goals lead to higher performance levels than 
no (i.e. low) goals. However, they did not support his finding that monetary 
incentives affect performance only through their effects on goal level or goal 
commitment. Subjects offered a high (piece-rate) incentive for performance 
showed higher output than subjects given no piece rate incentive or lower incen- 
tives, even though the goal levels of the various groups were apparently equiva- 
lent. Nor were differences found in the degree of conscious goal commitment 
expressed by subjects in the various incentive conditions. Possibly subconscious 
commitment differences could account for the results. On the other hand, differ- 
ence scores rather than residual gain scores were used as the measure of output, 
which could have confounded the results. 

Laboratory research on goal-setting has been criticized on the grounds that its 
findings have not been replicated in real-life settings. However, four recent studies 
by Latham and his colleagues indicate that these laboratory findings can be 
‘successfully replicated in and applied to actual industrial situations. Ronan, 
Latham & Kinne (80) found in two studies that high productivity among inde- 
pendent pulpwood producers was correlated with the- presence of a supervisor 
who stays on the job with his men and who sets production goals for them. 

In an experimental study, Latham & Kinne (47) found that overall crew output 
per man was higher over a 12 week period in crews where a specific, moderately 
difficult goal was assigned to the sawyers in each crew than in control crews in 
which the sawyers were assigned no such goals. 

The fourth study (46a) was also experimental but used a time series design. (A 
control group, unfortunately, was not available). Logging operators were assigned 
the goal of loading their trucks to 94% of maximum legal net weight; the previous 
average attained had been close to 60% of net weight. Latham found that perfor- 
mance improved immediately upon the introduction of the goals and remained at 
a high level throughout the following 9 months of the study. For the final 6 
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months, the percent of maximum load attained averaged over 90%. The net 
savings to the company of this increase in performance, as compared with what it 
would have cost to attain it through the purchase of additional trucks, was a, 
quarter of a million dollars! 

One surprising fact about the above studies was that goal assignments were 
made and performance increases obtained without the offering of any extrinsic 
reward for goal acceptance or goal attainment, and without any overt punishments 
being threatened for nonattainment. Obviously such dramatic results could not be 
expected in all organizations. 

In another industrial study, Dachler & Mobley (20) found significant correla- 
tions between employee’s self-chosen production goals, and subsequent perfor- 
mance in two different plants, although the correlations were not large. 

A number of recent studies of MBO have been summarized and integrated by 

Carroll & Tosi (18). These studies emphasize “soft” criteria, i.e. attitudes towards 
MBO and ratings of performance by superiors, rather than “hard” measures of 
actual organizational or individual effectiveness. 
. A number of interesting correlational findings emerged from these studies. For 
example, high goal clarity was consistently related to high satisfaction and high 
rated performance. High goal difficulty was associated with high performance 
only when the subordinates were mature and self-assured. Otherwise, hard goals 
tended to produce lowered performance. Presumably the hard goals demoralized 
and/or were rejected by employees with low self-confidence. Providing frequent 
feedback or performance reviews was virtually always associated with positive 
changes in attitudes and performance. 

Degree of participation in goal-setting had no consistent relation to subsequent 
performance. Generally employees performed better with participation if they 
were used to it, but not otherwise. Superiors were usually most willing to allow 
participation by those subordinates whom they considered to be most competent. 

Carroll & Tosi emphasize repeatedly that upper management commitment to 
MBO is crucial to its success, as is the integration of the MBO process with other 
organizational systems (e.g. the reward structure, budgeting, appraisal, etc). 

One element of MBO that has not received sufficient emphasis in the research 
literature is the development of action plans, i.e. identifying the means by which 
the goal(s) will be achieved. Nor has this element received much attention in 
laboratory studies of goal-setting because their main emphasis has been on di- 
rectly inciting the individual to greater effort rather than on developing strategies 
for goal attainment. However, in real life settings, increased effort is not always the 
most efficient method (or even a possible method) of attaining a new or harder 
goal. The most successful goal achiever is often the individual who devises the 
most effective or creative tactics and strategies for goal attainment or who most 
thoroughly develops the prerequisite skills in himself or his subordinates. More 
studies of individual differences in strategies or methods of arriving at or choosing 
strategies and of their relative effectiveness would be welcome. 

One final issue concerns the relationship between “goal theory” and expectancy 
theory. Expectancy theory, as originally developed (111), never incorporated an 
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explicit goal-setting stage, although some writers have suggested that one be 
added to the model (17, 20). One interesting difference in research findings 
‚between the two approaches is that expectancy theorists typically find a positive 
correlation between expectancy and performance, while goal-setting studies find 
that harder goals (with lower expectancies) lead to higher performance than easier 
goals [with higher expectancies (51)]. 

A recent study by Cartledge (19) attempted to reconcile some of the seemingly 
contradictory findings by positing that expectancy theory (i.e. the V, I, and E 
components) should be used to predict goal choice rather than to predict perfor- 
mance directly. In an experimental laboratory study Cartledge found a positive 
association between both E and V and the acceptance of an assigned goal. 
[Dachler & Mobley (20) also found that V-I-E theory predicted goal choice.] 
However, when all other factors were controlled, there was no significant relation 
between these factors and a measure of task effort (performance quantity, with 
ability partialed out). The best predictor of effort was the individual’s intended 
work rate; the next best was the individual’s perceived goal difficulty (measured by 
having the individual indicate how fast he would have to work to achieve his 
assigned goal if he tried for it). Both relationships held up when expectancy of 
success was partialled out. 

A positive association was found between expectancy of success and effort in 
one subgroup, however. This subgroup was composed of individuals whose sub- 
jective expectancies and “objective” probabilities of success (based on the diffi- 
culty of their assigned goal) were widely discrepant. Those whose expectancy 
estimates were “too high” had significantly more ability on the task than those 
whose estimates were “too low.” This indicates that the positive expectancy- 
performance correlations obtained in some previous studies might have been 
an artifact of differences in ability, since expectancy is presumably correlated with 
ability (72). 

Cartledge’s results indicate that in cases where the individual makes a definite 
commitment to a specific goal, performance (as measured by effort) is propor- 
tional to the demands of the task rather than to expectancy. 

In settings where the individual is not required to commit himself to a specific 
goal, one might expect somewhat different results, depending upon the goal level 
originally chosen and the changes in goal level that occur as a result of experience 
with the task. 


THE HERZBERG THEORY AND JOB ENRICHMENT 


The Herzberg theory (39) continues to draw the attention of researchers, and the 
results continue to throw doubt upon key aspects of the theory, e.g. the view that 
“Motivators” cause job satisfaction but not dissatisfaction, while “Hygienes” 
cause job dissatisfaction but not satisfaction (32, 37, 55-57, 94, 99, 104, 105). Only 
when Herzberg’s basic methodology, including his classification scheme, is used is 
the theory consistently supported (56, 59). 

Waters and his colleagues (104, 105) find that attitudes toward Motivators are 
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more highly correlated with both job satisfaction and job dissatisfaction than are 
attitudes toward Hygienes. This finding replicates the results of several earlier 
studies using similar correlational methods, as well as those of studies using more 
direct rating methods (94). 

The results using an “intensity” approach to determine the potency of Motiva- 
tors and Hygienes in causing satisfaction and dissatisfaction, however, do not 
support the results using a frequency approach. Generally the former method 
finds that both Motivators and Hygienes are reported to cause more satisfaction 
than dissatisfaction (32, 56). 

This writer has pointed out (36) that the Herzberg incident classification system 
confuses two levels of analysis, viz events (what event or condition occurred or was 
present) and agents (who or what caused the event to occur or brought about the 
condition). When employees are asked to describe satisfying and dissatisfying 
occasions, and the incidents are coded separately by event and agent, the Herzberg 
two-factor theory is not replicated. The consistent finding with respect to events is 
that Motivator events are more likely to produce both satisfaction and dissatis- 
faction than are Hygiene events (37, 55-57). In this respect the results agree with 
the typical result of the correlational studies mentioned above. 

With respect to agents, the results are more complex. There is a tendency, 
possibly defensive, for the self to be given credit for satisfying events and for others 
(supervisors, co-workers, the organization, etc) to be blamed for dissatisfying 
events (57). Overall, however, the supervisor and the organization are seen as by 
far the most important causal agents for both satisfying and dissatisfying events 
(57). 

On the issue of defensiveness, Wall (99) found that the tendency to list Hygienes 
as sources of dissatisfaction was significantly correlated with a measure of ego- 
defensiveness. Since many of Herzberg’s Hygienes are agents (e.g. supervisor, 
organizational policies, co-workers), this supports the notion that his original 
results are at least partly an artifact of defensive tendencies within individuals. 

Wall’s findings also point up another aspect of the Herzberg controversy, that of 
individual differences. A number of researchers have observed that, at least with 
respect to verbal reports of job attitudes, all individuals do not react equally 
favorably to Motivators, e.g. to opportunities for growth, autonomy, job enrich- 
ment, etc (41). 

The evidence suggests that there is a tendency for blue collar workers, especially 
those with urban, industria! backgrounds and those who choose to work in large 
firms to be more likely to report Hygienes and less likely to report Motivators as 
sources of satisfaction (37, 41, 42, 55, 57). Other studies, however, have questioned 
the validity of some of these conclusions (95). At root, of course, such differences 
must be an individual matter rather than a matter of demographics or occupa- 
tional level, since there are substantial individual differences among employees 
within all of the above groups. One individual trait claimed to be causally related 
to satisfaction with enriched jobs is cognitive complexity (93). Another is “higher 
order need strength” (48, 102). 

_ It should be noted, however, that what people say they get pleasure from on 
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their jobs is partly a function of the type of experiences they have had; therefore 
individual differences in such descriptions are not necessarily evidence for basic 
underlying value differences among employees (57). The fact that people do not 
claim to gain satisfaction from Motivators does not mean that they will not 
respond to them when they are provided. Some people do not have enough 
experience to predict accurately how they will respond to an entirely new job 
situation. Furthermore, the degree of satisfaction they experience with a given 
condition may have no relation to its effect on their job performance, since the two 
are not necessarily related (56). 

What would be interesting in this regard is an experimental study of the “hard” 
(performance) and “soft” (attitudinal) reactions of different types of employees 
(e.g. those who claim to value and those who claim not to value jobs with 
opportunities for cognitive growth) to job enrichment. 

The idea of job enrichment, though not originally developed by Herzberg, was 
popularized by him and has stimulated a great deal of applied research. Even 
though Herzberg’s theory appears mistaken, mainly with reference to the role of 
Motivators as dissatisfiers and the role of Hygienes as satisfiers, this has not led to 
severe problems in application, for the main thrust of job enrichment is to provide 
more Motivators as a means of increasing morale and motivation. 

The evidence from field studies is generally quite favorable to job enrichment 
with respect to improvements in both motivation and morale (27, 60, 89). Ford 
observed that about 85% of employees respond favorably to enrichment at AT & T 
(27). Practitioners report that it is very difficult to predict who will respond most 
favorably in advance. The most troublesome employees sometimes respond quite 
favorably, suggesting that this troublesomeness was an expression of frustration 
due to unused mental capacity (60). 

Experienced consultants in the job enrichment area warn against the danger of 
job enrichment becoming a fad rather than one solution to one type of problem, 
i.e. underutilization of employees’ skills and capacities, especially mental capa- 
cities (89, 92). Sirota & Wolfson have made a useful contribution to the field of 
pointing out the various obstacles to job enrichment which can be encountered in 
the course of implementation (e.g. wrong preconceptions of managers, poor 
diagnosis, technological constraints, etc) as well as advice on how to overcome 
these obstacles (90, 91). 

At the theoretical level, there is a need to begin identifying the particular 
elements of job enrichment and to identify their relative degree of effectiveness in 
producing higher morale and productivity. For example, the following types of 
changes have been made by practitioners under the guise of “job enrichment” (e.g. 
see 27, 31, 60, 89): 


— increased responsibility (taken from supervisor); 

— increased autonomy (elimination of rules which limit discretion); 
— increased recognition for job accomplishment; 

— development of work modules or “wholes”; 
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— provision of objective performance feedback regarding quality and/or 
quantity; 

— job rotation (greater variety of tasks at the same level); 

— upgrading of employee skills through training; 

— goal-setting; 

— use of new tools, procedures, and equipment; 

— development of cohesive work teams; 

— increased participation in decision-making; 

— pay raises; 

— reorganization of the work task to ‘improve “physical” efficiency (from a 
time-study viewpoint). 


It is clear from the above that a great many things are done under the rubric of 
job enrichment; yet the relationship of many of these changes to the actual theory 
of enrichment (e.g. Herzberg’s theory) is far from clear. It was mentioned above 
that job enrichment allegedly involves the provision of Motivators in order to 
increase satisfaction and performance. However, pay raises, participation, and the 
development of work teams are Hygiene factors not Motivator factors. Goal- 
setting, according to Ford, is not the way to enrich jobs (27, p. 28), even though 
considerable research indicates that goal achievement is a major source of job 
satisfaction. Finally, feedback and improvements in tools, equipment, and in 
general physical efficiency stem more from the theory of scientific management 
(96) than from job enrichment theory. 

The fact that what people do under the guise of job enrichment usually seems to 
“work” is encouraging from an applied standpoint; but from a theoretical view- 
point (and from a long-range applied viewpoint) it would be useful to know why 
these changes work. Studying job enrichment in terms of its various elements is 
extremely difficult, especially in field settings, but studies of this type might prove 
very fruitful in explaining the results of job enrichment studies and in providing a 
basis for more complete theories of employee motivation. Such theories, of course, 
` would subsequently aid practitioners. 

Large scale experimental studies are not the only method by which this could be 
accomplished. Intensive studies of one or two groups, changing various work 
attributes in sequence, might provide useful information despite the drawbacks of 
such time series designs. 


PAY AND MONETARY INCENTIVES 


Despite decades of ritualistic attacks on the concept of “economic man,” and 
despite Taylor’s somewhat oversimplified assumptions about worker motivation 
(96), employees continue to want compensation in return for their work and 
researchers continue to find it a stimulating research area (97). No single theore- 
tical viewpoint dominates thinking on this subject. Rather one could say that 
thinking about pay permeates all theoretical approaches to employee motivation. 
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For example, Herzberg’s theory stresses its role as a Hygiene factor (39), al- 
though research indicates that it can serve both as a motivator of job performance 
(48) and as a source of job satisfaction (37, 57). Beer & Gery (10) argue that this 
depends on the type of pay system. Merit systems may be perceived as instru- 
mental in satisfying “higher order needs,” while security-type payment systems 
are seen as satisfying “lower order needs.” 

Job enrichment specialists argue that enrichment must be accompanied by pay 
raises to avoid feelings of inequity (88). There is ample evidence for the view that 
perceived equity is an important determinant of pay satisfaction (26, 46, 76), even 
though there is as yet no adequate theory which would account for the different 
ways in which equity is conceived by different employees (106). 

Expectancy theorists have focused on the relationship between performance 
and the degree to which high output is seen as instrumental in leading to higher 
pay (20, 44, 84, 87). Other conditions facilitating the effectiveness of a pay incen- 
tive plan include: the absence of group norms opposing high production; high. 
valuation of pay by the employees; ability of the employees to understand the 
incentive plan; and a feeling on the part of the employees of support by and trust 
of upper management (16). 

Another recent study supporting the expectancy theory viewpoint found that 
productivity was higher when it was based on individual as well as group output 
than when it was based on group output alone (108). 

On the opposite side of the coin, there is some evidence (from laboratory 
studies) that when money is made contingent upon performance, it reduces the 
individual’s intrinsic motivation, whereas if money is noncontingent, it does not 
reduce intrinsic motivation (23). This would lead one to predict that employees 
under merit systems would be less intrinsically motivated than those employed 
under a civil service or seniority system, a prediction which does not accord well 
with the common-sense view of nonmerit systems as promoting inefficiency. Ac- 
tually, the above studies only compared piece rate vs noncontingent hourly pay. 
There was no intermediate condition in which there was hourly pay, but where 
promotion and retention depended upon performance. 

Advocates of the “human relations” school of employee motivation favor group 
incentive plans like the Scanlon plan in order to encourage teamwork and prevent 
inter- and/or intragroup competition. Research on the Scanlon plan indicates that 
management attitudes toward employees and management's degree of commit- 
ment to a participative style of management are among the most important 
elements contributing to its success or failure in a given organization (81, 110). 
Expectancy theorists would question the degree to which a company-wide incen- 
tive would aid performance because of the limited degree to which any single 
individual’s effort could affect.company performance (108). However, it has not 
been proven that the incentive aspect of the plan is responsible for its success. A 
key element of the Scanlon plan is the development of better communication 
between management and labor through group meetings combined with a sug- 
gestion plan. Possibly the latter elements of the Scanlon plan are actually respon- 
sible for its success in those organizations where it has worked. 
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Advocates of Management by Objectives also stress the importance of pay in 
facilitating the success of this approach. It is asserted that MBO must be integrated 
with the organization’s appraisal and reward systems in order for the managerial 
personnel to be committed to the goal-setting approach and to attaining the goals 
which they set (18). Dearden (22) has pointed out the complexities and pitfalls 
involved in the installation of any type of incentive plan. 


JOB SATISFACTION AND JOB PERFORMANCE 


Recent studies have further confirmed a host of earlier studies (56) to the effect 
that job satisfaction exerts little or no causal influence on productivity. When the 
two are related at all, the evidence favors an interpretation in favor of productivity 
as a cause of satisfaction (34, 101) rather than vice versa. It has been argued that 
productivity and satisfaction will be related in this fashion to the extent that there 
is a relationship between productivity and the attainment of the individual’s 
important work values and goals (53). Nathanson & Becker (69) found some 
support for this proposition in a study of physicians. In circumstances where it 
could be assumed that physicians derived intrinsic satisfaction from outpatient 
care activities, or that they received recognition for such activities, or that outpa- 
tient care was an end in itself rather than a means to an end, satisfaction and 
productivity were significantly correlated. When these circumstances did not hold, 
the two variables were unrelated. 

Recent findings concerning the effect of job satisfaction on withdrawal from the 
job also confirmed earlier results. Porter & Steers (73) carefully reviewed over 60 
studies from the last.15 years and found substantial support for a relation between 
satisfaction and both turnover and absenteeism. A number of longitudinal and 
field-experimental studies supported the results obtained with the more common 
concurrent or retrospective studies. 

Studies published since Porter & Steers’ review further support their results, 
regardless of whether job attraction is measured with the Cornell JDI (103), 
expectancy theory measures combining the V-I-E components (24), or Motiva- 
tor-Hygiene deprivation (7, 45). However, Porter & Steers note that the bes! 
individual predictor of termination may be the individual’s intention to terminate. 
[Others have found frequency of “thoughts about quitting” to work well (7).] 
Presumably the intent to leave takes account of the individual’s attitude toward his 
job in addition to other factors which could affect his decision, e.g. economic need, 
availability of other jobs, family considerations, attractiveness of fringe benefits on 
present job, etc. 

The specific variables which Porter & Steers found to be associated with a low 
degree of withdrawal included: pay equity; opportunities for promotion; consid- 
erate supervisory style; small group size; characteristics of the work itself such as 
responsibility, autonomy, variety, and low stress; age; long previous tenure; and 
interests compatible with present work requirements. Porter & Steers hypothe- 
sized that the highest probabilities of withdrawal are shown by two types of 
employees: those who are anxious and emotionally unstable; and those who are 
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highly ambitious, self-confident, independent, and aggressive. Presumably the 
first type is easily threatened and impulsive about acting on his negative feelings, 
while the second type is easily frustrated if in of opportunities commen- 
surate with his aspirations (e.g. see 39a). 

Another interesting finding in the recent literature is that there is often a 
continuum of withdrawal from absenteeism to turnover with high (or increased) 
absenteeism preceding a final decision to terminate (15, 58). 

Since it is well accepted that satisfaction is a function in some way of the fit 
between individual values and job conditions, this suggests that there are two 
possible ways of increasing satisfaction: changing the person and changing the 
job. There is some evidence that giving the individual a realistic job preview can 
reduce disappointment and increase attraction (100). While such previews have 
been credited with helping the individual develop more realistic job expectations, 
this writer has argued elsewhere that expectations as such (divorced from aspira- 
tions, i.e. what the individual wants or values) will not have a major effect on 
attitudes (56). It was hypothesized that changing expectations would affect atti- 
tudes through their effect on the individual’s job aspirations (or more probably, his 
view of what he would consider to be minimally acceptable on a job). 

An organization need not change an individual’s values in order to insure that 
he will fit into the organization, however. It can cash in on existing differences 
between individuals by selecting those who have values and aspirations compat- 
ible with the organizational structure. To a considerable extent, of course, individ- 
uals select themselves into (and out of) organizations in this manner (30, 42). In 
addition, some value adaptation occurs as a result of E in the organiza- 
tion (40). 

It is easier, of course, for an organization to modify Sonden than to modify 
individual values. The potential of the former approach (e.g. job enrichment, 
supervisory training, OD, incentives) is well known and needs no further discus- 
sion here (e.g. see 28). 

It has been argued elsewhere that the relationship between attitudes and with- 
drawal is now well enough established that more studies of this type will not add to 
our understanding of employee motivation and behavior (56). What is needed are 
studies involving: explanations for off-quadrant cases, i.e. those who are satisfied 
and withdraw, and those who are dissatisfied and do not withdraw; the cognitive 
processes leading to the decision to withdraw; and the study of qualitative as well 
as quantitative attitudinal factors as they relate to withdrawal (e.g. see 42). Porter 
& Steers (73) recommend additional types of studies. 

A recently studied variable which has been found to relate to withdrawal, 
satisfaction, and job performance is that of “role orientation,” defined by Graen, 
et al as “the perceived relevance of the job for the [employee’s] work career” (33, p. 
401). New nonacademic employees of a large university who saw their job as being 
related to work they planned to perform in the future (presumably as part of their 
career plans) were more Satisfied, were rated as better performers by their super- 
visors, and were less likely to terminate than those who saw their jobs as unrelated 
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to their career plans. An explanation for this finding remains to be found, but 
presumably it entails the individuals willingness or desire to fully accept and to 
become involved in his work role. Further studies of role orientation seem war- 
ranted. 


MISCELLANEOUS 


Several interesting studies did not fall into any of the above categories. One was a 
longitudinal study of worker adjustment to the 4-day work week (71), appärently 
the only such study on this subject published to date. The effects of the change ` 
were generally favorable with respect to both job attitudes and reduced absen- 
teeism. However, employees reported fewer favorable effects on home life after 
one year than after 6 or 13 weeks on the 4-day week. 

Another interesting study examined, through extensive interviews, the reactions 
of key managers from 90 small business firms to the flood damage wrought by 
hurricane Agnes in June 1972 (1). Degree of perceived threat was most highly (and 
negatively) related to high I scores on Rotter’s I-E scale. High I scores were also 
highly related to the number of task-oriented coping mechanisms used (e.g. trying 
to obtain loans). Those with high E scores, on the other hand, were more likely to 
employ coping mechanisms designed to reduce, control, or express dysfunctional 
emotions such as anxiety (e.g. withdrawal, aggression). While the number of 
different types of task-oriented or emotion-oriented coping mechanisms used were 
unrelated to coping effectiveness as rated by the interviewers, the total frequency 
with which task-relevant coping methods were employed was highly and posi- 
tively related to coping effectiveness. 

In this writer’s opinion, one of the most important books to be published in 
psychology in some time is Irving Janis’ Victims of Groupthink (43). Janis views 
groupthink as a product of pressures for conformity which arise within cohesive 
groups. He defines it as: “a mode of thinking that people engage in when they are 
deeply involved in a cohesive in-group, when the members’ strivings for unani- 
mity override their motivation to realistically appraise alternative courses of ac- 
tion ... Groupthink refers to a deterioration of mental efficiency, reality testing, 
and moral judgment that results from in-group pressures” (43, p. 9) Although the 
focus of Janis’ book is on the effects of groupthink on the quality of selected 
American foreign policy decisions, the implications of his ideas for organization 
management are obvious. Especially interesting are Janis’ suggestions for 
preventing groupthink. Advocates of the “human relations” approach to em- 
ployee morale and motivation may find this book particularly enlightening. 

Finally, recent studies by Miner lead him to forecast an alarming shortage of 
managerial manpower in the years ahead (63, 64). The reason for this, according to 
Miner, is a precipitous drop in the number of students with values compatible with 
the values required for managerial success. This will lead, argues Miner, to re- 
cruiting difficulties and/or to poor performance among those who are hired but 
who lack the requisite motivational structure. Miner proposes increased use of 
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“role-motivation” training as the best solution to this problem (63), although it 
remains to be seen whether this will work in view of the wider cultural context in 
which these value changes are occurring, i.e. the philosophical assault on business 
and capitalism that is occurring in our colleges and universities (79). 
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The 1972 review of personnel selection by Bray & Moses (31) focused upon the 
unfair test controversy generated by the consequences of the Tower Amendment 
to Title VII (Employment) of the Civil Rights Act of 1964 (48). In the 3-year 
period covered by the present review, 1971 through 1973, the extent of involve- 
ment of legal process in selection procedures and policies has broadened enor- 
mously. 

Despite the pressure, concern, interest, and effort invested, however, progress in 
both theory and practice in the development of selection procedures to meet these 
legal and the related technical challenges has been disappointingly limited. 
`- The three most notable developments in the triennium include: (a) the concur- 
rent publication of four major summaries of testing, each from a different frame of 
reference, and all contributing to a new, integrated view of the field [Buros’ (35) 
Seventh Mental Measurements Yearbook, DuBois’ (67) History of Psychological 
Testing, Byham & Spitzer’s (37) study of The Law and Personnel Testing, and 
Holmen & Docter’s (96) report on Educational and Psychological Testing]; (b) a 
review, consolidation, and revision of the professional standards for testing (5, 6), 
and at the federal government level a proposed set of Uniform Guidelines on 
Employee Selection Procedures (165) which will supercede the EEOC Guidelines 
(92) and the OFCC Order on Testing (124); and (c) a broadening critique of the 
notion of differential validity and the “moderator variable” approach to the 
establishment of differential validity. 

Several minor developments are also worthy of note: alternatives to the objec- 
tive-type multiple-choice test such as job element examining, man-job matching, 
and similar techniques; an expanded interest in job analysis; and the beginnings 
of systematic inquiry into the procedures and problems of content validity. 

Assessment center technology, interviewing, and other “classical” techniques 
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and topics continued to receive attention over the review period but without any 
notable developments. We do not seem to do better in 1973 than our ancestors did 
in 1917. 
This review is organized around five topics: 
1. THE STATE OF THE ART 
2. TESTING AND THE LAW 
3. METHODOLOGICAL PROBLEMS 
4. SPECIFIC SELECTION TECHNIQUES AND AREAS 
5. CONCLUSIONS 


THE STATE OF THE ART 


DuBois (67) traces the history of testing from its genesis in China circa 2200 B.C. as 
a method to select government officials, to its adoption four millenia later by the 
French in 1796, the English in 1833, and the United States in 1883. Over these 
millenia, tests were anchored in job content. Two inventions, however, established 
the pattern most characteristic of testing today: Binet’s use of a sample of generic 
human tasks to measure “intelligence,” and then the reduction of such items to the 
objective-type multiple-choice paper-and-pencil format. A simplistic view of the 
generic abilities required to perform successfully in any employment was married 
to an economical (cheap) methodology; job content was largely deleted from 
selection procedures and survived, if at all, on the criterion side of what emerged 
as the predictor-criterion relationship. It is at least arguable that in the more than 
half a century of experience with objective-type multiple-choice testing, little 
improvement has been achieved in this prediction, even in the face of ever more 
complicated statistical methodology. Ghiselli (83), summarizing all available 
studies from 1920 through 1966, noted an average validity of .19 for proficiency 
criteria and .29 for training criteria. Updating the study to 1973 (85), the averages 
increased to .22 and .39 respectively. In the latter report, Ghiselli calls our atten- 
tion to “maximal validity coefficients” (.35 for proficiency criteria and .45 for 
training criteria)—the average of the “highest average validity coefficient” for each 
job. The report, however, does not indicate the number of studies involved nor the 
distribution of studies by validity attained. Nor can it take into account the 
unreported studies, although negative results are more likely to go unreported 
than positive results. The current critique of personnel testing is based not only on 
the proposition that tests may discriminate on a racial or ethnic basis; the majority 
of studies reported show that they do not predict job performance for any group 
(29, 109, 124, 155). 


General Treatments 


New, comprehensive contributions to the literature were limited: a text by Tyler 
(162); a newly edited set of readings partly devoted to personnel selection by 
Fleishman & Bass (77); a short but thoughtful summary by Bass (24); a variety of 
semitechnical “how-to-do-it” publications such as those of Ash (13) and Taylor 
(158); and a useful classified bibliography by Tenopyr (159), supplemented by a 
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spate of articles in the personnel journals (e.g. 157). Boyd & Shimberg (30) 

produced a helpful handbook on performance testing; Campion (42) describes the 

use of work sampling for personnel selection. Performance testing has been largely 

neglected in favor of the attractive economy of group-administered objective-type 

paper-and-pencil tests. The emphasis on “job-relatedness” in court decisions and 

federal test regulations, however, is likely to result in increased interest in the. 
performance or work sample. Thomas (160), argues, however, that while the 

work-sample technique may have higher construct validity than paper-and-pencil 

psychological tests for manually oriented occupations, it has much less validity for 

those occupations requiring abstract reasoning. 


Testing in the Seventies 


Testing is a massive enterprise and a big business. Buros’ (35) Seventh Yearbook 
attests to the first: two volumes listing 1157 tests and a variety of indexes based 
thereon. Holmen & Docter (96) adumbrate the second—six giants in the field 
collectively account for approximately three-quarters of the total U.S. test sales. It 
is not surprising to learn that the practices of test publishers fall somewhat short of 
professional standards. 

The question of whether or not tests invade privacy continues to be of interest. 
In regard to personnel selection, Rosenbaum (145) found, however, that the 
perception of the invasion tends to be idiosyncratic, a function of personal char- 
acteristics. What is private to you is not necessarily private to me, and vice versa. 


APA Standards 


The 1966 Standards (5) were directed primarily toward test publishers; the em- 
phasis was on the content of the test manual. The 1974 revision (6), a collaborative 
effort of the APA, the American Educational Research Association, and the 
National Council on Measurement in Education, addresses itself also to the 
qualifications of the test user and to the “equal employment opportunity” issues 
that have emerged in regard to testing since 1966. 


Job Analysis 


One effect of the U.S. Supreme Court decision in Griggs vs Duke Power (63) was 
to reaffirm legally both doctrine and received tradition in personnel selection: 
selection predictors should be job related. To attain that result, job analysis is an 
essential step [writ large in the proposed new federal guidelines (165)]. Brute force 
empiricism—a battery of tests selected willy-nilly and tried out against any avail- 
able criteria—will probably not satisfy the “job relatedness” requirement unless a 
showing can be made of both satisfactory validity coefficients and a logical or 
rational relationship between test content and job tasks. 

Job analysis methodology has become a major area for exploration and theor- 
izing. Prien & Ronan (131) provide an extensive review of findings on methods of 
conducting job analysis and utilization of job analysis results. They identify six 
“basic considerations” for which further research is needed: the reliability and 
validity of responses obtained from informants; the relations between job function 
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components and person characteristics; the relations between job functions and 
performance style; the relations between function characteristics (e.g. time re- 
quirements, level of difficulty) and person characteristics; the construct validation 
of job functions and function composites; and the appropriate experimental 
designs and statistical paradigms for analysis of job data. 

With respect to specific developed approaches, Fine & Wiley (76) published an 
introduction to Fine’s functional job analysis, Fleishman & Stephenson (78) 
reviewed progress on his motor-skilled-oriented taxonomy of human perfor- 
mance, and McCormick and his associates described applications of the Position 

. Analysis Questionnaire to the study of jobs (114, 115) and to “indirect” validity 
(i.e. a version of synthetic validity) (113). 

Public jurisdictions, because of the exigencies of merit system examining, are 
more likely than private employers to emphasize job relatedness in tests, and to 
attempt to justify them on the basis of content validity rather than criterion-related 
validity. Reflecting this need, several government agencies, state and federal, have 
published manuals on job analysis specifically designed for personnel selection 
(139, 164, 175). > 


Content Validity 


Although content validity has been one of the three main approaches to test 
validation included in the APA Standards, and has been set forth as a permissible 
approach to validation in both OFCC and EEOC regulations, literature on a 
relevant methodology for demonstrating its existence is practically nonexistent. 
Lennon (108) explored in 1956 some of the assumptions underlying the technique, 
but left procedures to others. The 1972 amendments to CRA 64, however, bringing 
merit systems under EEOC scrutiny, have generated new interest in this approach 
to validation, especially since a significant fraction of Civil Service tests are 
designed to measure subject-matter knowledge. Mussio & Smith (121) put to- 
gether what is probably the first procedural manual on how to conduct and 
document a content validity study. 


Synthetic Validity 


Developed to provide estimates of test validity in the absence of data on an 
adequate sample of examinees for whom both test scores and criterion scores are 
available, the various approaches to synthetic validity involve matching a test- ` 
element profile with a job-element profile. Gordon (89) presents a rousing chal- 
lenge to the more usual criterion validity approach, and opts for synthetic validity 
analysis as preferable. Primoff has made the most significant contribu- 
tion to the technique in his development of the J-coefficient (132, 133). A recent 
manual (136) outlines step-by-step application of the technique. One important 
limitation must be recognized: a J-coefficient analysis provides no data on possible 
differential validity on a racial or ethnic basis, since person-data are not involved. 
Cleff’s (50) man-job matching technique is an interesting alternative to test-based 
synthetic validity approaches. Instead of correlating job attribute profiles with test 
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attribute profiles, the Cleff | technique directly correlates job-task profiles with 
self-rating task profiles obtained from applicants, and dispenses with tests alto- 
gether. 


Job Element Examining 


The typical objective-type multiple-choice test places a premium on reading skills 
and also in blue-collar jobs frequently screens out minority group applicants. The 
job element examination procedure developed by Primoff at the U.S. Civil Service 
Commission is one of the most creative innovations in selection technology to 
emerge in the past decade, but its spread has been limited due to the almost total 
absence of publication in professional journals. Materials are available, however, 
from the U.S. Civil Service Commission (134, 135, 137). 
The three basic ideas behind the job element method are: 1. Work for a position 
may be described by a small set of rather broad job elements such as “ability to 
inspect,” “ability to lead or supervise,” “reliability and dependability,” etc. 2. It is 
possible to discover systematically the level of ability attained or potential of each 
applicant by drawing upon several sources of information: (a) the application 
form; (b) a supplementary information questionnaire; (c) reference inquiry 
replies; (d) scores on aptitude, ability, and other kinds of tests. 3. Information 
about each applicant can be evaluated (rated) against “benchmark” behaviors for 
a position and classified at one of five levels of adequacy for each element: 4- 
demonstrated superior ability; 3-demonstrated satisfactory ability; 2-barely 
acceptable or potential ability; 1- ability of some value, but less than barely 
acceptable; 0- training, experience, or test performance of no value. 

Ash et al (16) describe an application to a state civil service system. The 
technique has also been introduced into selection procedures in industry, for 
mechanics, servicemen, and a variety of laborer occupations. 


Criteria 


Test validation, in the usual case, is based upon criterion validity. The criterion 
remains the weak link in the chain. Smith et al (155) have produced a good 
journeyman analysis of the characteristics of criteria to be attended to and a 
detailed rundown of almost all types of criteria used to evaluate job performance. 
An appendix summarizes an analysis of 118 validation studies conducted in 37 
governmental agencies showing that most current practice is far below the poten- 
tial, however; in 85 studies only judgmental criteria, i.e. supervisory ratings, were 
used. In almost two-thirds of these studies the test-criterion correlations were not 
statistically significant. Cravens & Woodruff (59) present an interesting method- 
ology for developing criteria of salesman performance based upon a model of 
sales territory performance. 

In a study somewhat reminiscent of Ghiselli’s (83, 86), Lent et al (109) computed 
Significance Batting Averages (SBAs) for various combinations of predictor and 
criterion categories for a sample of 406 studies that yielded a total of 1506 joint 
predictor-criterion uses. Combining all studies, the overall SBA was .43: 43 per- 
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cent of the reported relationships were significant. Maximum Performance pre- 
dictors (e.g. aptitude tests) were more likely to yield significant results than Typical 
Performance predictors (e.g. personality or interest tests) (55 percent to 21 per- 
cent). Predictive validity studies yielded more frequent significant results than 
concurrent designs. The analysis has interesting possibilities for picking predic- 
tor-criterion categories most likely to yield significant validity results, and possibly 
for identifying moderator variables for pairings where correlation results are 
usually not significant. 

As the foregoing “state of the art” summary is intended to suggest, the art has 
largely remained status quo ante. | 


TESTING AND CIVIL RIGHTS 


Of the approximately 70,000 complaints filed annually with the Equal Employ- 
ment Opportunity Commission, 15 to 20 percent involve alleged discrimination by 
unfair testing (72). The spate of court cases involving “expert” psychological 
testimony has led to the witticism that Title VII of CRA 1964 is “the psychologists’ 
social security act.” The overriding issue is the extent to which psychological tests 
and other selection criteria which have documentable adverse impact against 
minority group applicants, particularly Blacks and Spanish-surname individuals, 
discriminate against them unfairly—i.e. underpredict their performance on the job 
or simply fail to provide a useful prediction of job performance. Significant 
differences of opinion have emerged with respect to two subsidiary issues: first, 
what are the criteria by which the existence of discrimination may be said to exist 
(the consensus is that the fact of adverse impact itself and the finding of significant 
differences in mean predictor scores are not sufficient evidence of discriminatory 
impact), and second, do the data in fact support as a general proposition the 
existence of differential validity? 

While industrial psychologists have largely rejected the point of view that the 
inferior performance of Blacks, particularly on cognitive tests, is to be expected 
because of hereditary racial differences, they seem equally unwilling to embrace 
the notion, advanced by Richardson & Spears (141), that the whole fault lies with 
the “psychometric approach”—that Blacks are different, not deficient, that psy- 
chometrics has failed to take account of cognitive style, of the social context of 
testing, of the diversity of human abilities, of the importance of nutrition, of the 
differences of score distributions and trait distributions. Rather, admitting that 
tests and other selection criteria measure the results of deprivation and discrim- 
ination, the emphasis is placed upon the prediction of job performance. This is 
the principal thrust of federal and state regulation in the area of testing:,if a 
plaintiff, on whom the initial burden rests, can demonstrate adverse impact, the 
burden shifts to the respondent to demonstrate that the selection procedure in- 
volved is a valid predictor of job performance, and that an alternative procedure 
entailing less adverse impact is not available. 

Byham & Spitzer (36, 37) bring together some of the literature from Myart vs 
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Motorola to Douglas vs Hampton in a volume that attempis to spell out the 
implications for personnel management of the EEOC Guidelines on Employment 
Selection Procedures and the Office of Federal Contract Compliances’ Order on 
Testing, although little reference is made to the litigation on tests in federal or state 
courts. Their main focus is selection practices and procedures as they relate to 
equal employment opportunity. The discussions of job-relatedness (validity) and 
of differential validity should be helpful to personnel managers whose knowledge 
of the technical aspects of testing is limited. 

In a follow-up of an earlier analysis by Cooper & Sobol (56), the Harvard Law 
Review summarized the implications of Title VII of CRA 1964 with an analysis of 
more recent cases (72). Peskin (128) offers guidelines for the development of a 
viable equal employment opportunity program. Both law and regulation on the 
one hand, and social and economic pressure on the other, increasingly have been 
shaping personnel selection practices in advance of, and sometimes in spite of, the 
specific results of psychometric research on test technology. Even if it could be 
unequivocably demonstrated that adverse impact was the necessary result of tests 
of high validity and accurate prediction of job performance—such a demonstra- 
tion does not yet seem to be within reach of the psychometric art—the imperatives 
of redress for the past and “affirmative action” in the present leave us with a sterile 
finding only minimally applicable to the circumstances of employment today. 


Testing Guidelines 


The 1966 EEOC Guidelines on Testing (92), calling for validation in the face of 
adverse impact and strongly endorsed by the U.S. Supreme Court in Griggs vs 
Duke Power (63), was followed in 1968 by the Office of Federal Contract Com- 
pliance with an order (124) spelling out in greater detail the requirements for test 
validation and incorporating the standards of test validation incorporated in the 
APA Standards (5). EEOC amplified its Guidelines in 1970 (92) to make them 
essentially similar to the OFCC Order. Although OFCC could cancel government 
contracts for noncompliance, under the original law EEOC had only limited 
power to intervene and practically none to initiate cases or impose sanctions. The 
1972 amendments to CRA 1964 (48), however, have given EEOC substantially 
greater authority: to initiate cases and sue in its own behalf with its own corps of 
attorneys, and to extend its regulations to the public sector, with the exception only 
of the federal government itself. The 1972 amendments also established an Equal 
Employment Opportunity Coordinating Council representing the five federal 
agencies involved (EEOC, OFCC, U.S. Civil Service Commission, Civil Rights 
Division of the Department of Justice, U.S. Commission on Human Rights). 
Among other duties, the council was charged with developing Uniform Guidelines 
on Employee Selection Procedures to be applicable to the federal agencies involved 
in test litigation. The new Guidelines (165) probably will be issued in final form in 
1974. 

In addition to EEOC and OFCC, other state and federal agencies have issued 
regulations governing selection procedures. Probably the most extensive effort at 
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the state level has been that ofthe Technical Advisory Committee on Testing in 
California (38). The United States Civil Service Commission (163) has published 
standards for governmental jurisdictions. 

The history of these guidelines is a good example of the disparity between 
theory and application. The APA Standards (5, 6) set forth a professional con- 
sensus with respect to how tests should be developed and applied, but their 
translation into requirements in the draft of Uniform Guidelines has evoked 
widespread response and extensive debate both within and outside the profession 
(e.g. 118) as to whether the regulations are realistic in content and achievable in 
practice. OFCC, however, has published an amendment to its order incorporating 
that section of the Uniform Guidelines which deals in very specific detail with 
requirements for the presentation of evidence of test validity (125). 

Biddle (26) provides a summary of the laws involved, the type of job, the validity 
evidence offered, and the awards in 50 leading public and private sector cases 
brought under various laws against discrimination, and the National Employment 
Law Project has published (64) a similar summary of 28 cases involving discrim- 
ination in the public sector. As regulations proliferate and become more specific, 
the only sure prediction to be made is that litigation over selection procedures wil] 
increase. 


Differential Validity or Ethnic Differences in the 
Prediction of Job Performance 


The statistical questions centering on ethnic group membership as a moderator 
variable in the prediction of job performance from test results is treated in a 
subsequent section of this review. Here we examine the evidence for or against 
differential validity. 

The first problem involves an analysis of the criteria under which it may be said 
that differential validity exists at all. Boehm (29) identifies differential validity as 
the case in which there is a significant difference between the validity coefficients 
of the two ethnic groups compared and one or both coefficients are significantly 
different from zero, and excludes single-group validity where the obtained corre- 
lations are not significantly different. Schmidt et al (148), reanalyzing the Boehm 
hypothesis, conclude that single-group validity is “... probably illusory in na- 
ture”, a function of small sample size and other characteristics of the data for the 
minority group. Schmidt & Hunter (150) reexamine the Cleary (49) and Thorn- 
dike (161) definitions of test bias—a test is biased if criterion prediction scores are 
too low, or if the test fails to select the same proportion of minority applicants that 
would be selected on the criterion itself or on a perfectly valid test—and conclude 
that the former definition is the more defensible. Humphreys (99), using a differ- 
ent rationale, recommends a direct comparison of correlations in the minority and 
majority group samples, and that “only in the event that the minority correlation is 
significantly lower and the confidence limits around that correlation include no 
useful levels of the relationship should the correlation be considered essentially 
zero.” Cole (52) presents a model of equal opportunity for potentially successful 
applicants. Einhorn & Bass (69) review the general methodological considerations 
involved. 
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The data on differential validity, however, fit uncomfortably with all of the 
theories. 

Significant differences in validity correlations are found; single validity vs zero 
validity coefficients are found; and perhaps most frequently (126), zero validity for 
both groups is found. Bass & Turner (25) report ethnic group differences in 
relationships among criteria of job performance, suggesting that lower predictor 
scores for minority groups reflected accurate performance predictions. Campbell 
(40) and Crooks (60), on the basis of an elaborate study of three jobs in the federal 
service, conclude that differential validity is an artifact of poor criterion design. 
Einhorn & Bass (69) address their attention to methodological considerations. 
Gael & Grant (82) and Grant & Bray (90) report that for telephone company jobs, 
although mean predictor and criterion scores differed, the regression slopes were 
similar, indicating that EES group examinees fared less well on both predic- 
tors and criteria. 

The ETS study (60) is probably the most Gebei of any to date, but the use of 
job knowledge tests as criteria probably inflated test-criterion relationships by way 
of commonality of such factors as reading facility, and the authors’ extrapolation 
to test selection in general is extremely optimistic. Federal guidelines call for 
separate validation for each ethnic group where feasible (e.g. enough cases are 
available). In the present state of the art, given the limited amount of consistent 
hard data available, that would seem to be the most prudent course. Even ifitmay 
be generally true that differential validity or even single-group validity is the 
exception rather than the rule, neither currently observed test validities nor 
ethnic-comparison studies provide much assurance that, in particular situations, 
something like differential validity does not exist. 


Merit Systems 


When vacancies in a specific job occur in the typical state, municipal, county, or 
other governmental unit merit system, an “examination announcement” is issued, 
candidates are screened for admission to examination on the basis of credentials 
(such as completion of high school), assembled and tested, and the names of those 
who “pass” are entered on a register. To fill a particular opening, the top three, top 
five, or some larger group defined by a referral rule, are referred to the cognizant 
supervisor, who chooses from among the referees. 

This procedure has tended to thwart the entry of minority group members into 
civil service employment at two points. First, much more frequently than non- 
minority group members, they fail to meet the entry qualifications requirements, 
particularly the high school graduation requirement. Second, they fail the written 
tests at a much higher rate. 

The most extensive challenge to these results has been mounted against police 
selection procedures (4, 32, 45, 46, 64, 71). Sometimes independently, sometimes 
jointly, selection of firemen has also been challenged (45, 51, 81). 

As in the Griggs case, where the high school education requirement is imposed, 
no evidence is adduced as to its validity. The tests used for police selection range 
from homemade lists of questions to such old warhorses as Army Alpha, the 
WWII Army General Classification Test, the Wonderlic, and a series of tests 
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published by the International Personnel Management Association (formerly the 
Public Personnel Association). Evidence of test validity has been almost univer- 
sally absent in these cases, and in most of them the courts have struck down both 
the tests and the education requirement. Under the 1972 amendments to CRA 64, 
merit systems must abide by the guidelines set forth in the Griggs case. The 
implementation of this requirement, given the differences in ground rules as 
between merit system hiring and private industry hiring, constitutes a major 
challenge to test methodology. 

Among all the cases challenging merit system selection procedures, probably 
the most interesting and far-reaching is Douglas vs Hampton (66), which involves 
a classical application of the generic intellectual aptitude concept to selection for 
employment. From the early 1930s until 1955, the U.S. Civil Service Commission 
administered a series of content-oriented tests, collectively known as the Junior 
Professional Assistant examinations, to qualify college graduates for entrance into 
specialized technical and professional positions. In 1955, the Commission 
introduced the Federal Service Entrance Examination, a high-level two-part test 
of verbal facility and numerical reasoning, as a generic measure for entry into 
more than 200 widely varying professional, managerial, and technical positions in 
federal agencies. Its generic character was also attested to by the fact that college 
students who took the Graduate Record Examination could offer their scores on 
that exam for the FSEE, until the Educational Testing Service refused to release 
the GRE scores on the basis that GRE was not designed or validated for occupa- 
tional selection and placement. The plaintiffs in Douglas vs Hampton (66), in a 
class-action suit, made a plausible case of adverse impact against Blacks, and the 
defending U.S. Civil Service Commission has thus far largely based its case upon a 
_ technique it describes as “job analytic” validity, involving a showing that the 
measured verbal facility and numerical facility are required in jobs as diverse as 
park ranger at Yellowstone Park, baggage and customs examiner at O'Hare 
Airport, and human relations counselor in a big city. As this review goes to press, 
the U.S. Civil Service Commission (163a) has announced that it will replace the 
FSEE in the Fall of 1974 with a new Professional and Administrative Career 


Examination (PACE). The replacement exam “... is expected to measure more 
accurately an applicant’s ability to perform in a given job, which in turn will result 
in better placement. 


“All candidates will compete in a new written test which measures five abilities 
needed for successful performance. Each of the five parts will be weighted in 
relationship to the work to be done.” The use by merit systems of unvalidated tests 
of general intellectual ability may well be drawing to an end. 


The Private Sector 


While particular attention has been drawn to litigation involving tests and selec- 
tion procedures in the public sector, the majority of cases over the past 3 years have 
been in the private sector. Since the Griggs case, few respondent companies have 
attempted to justify the use of tests without evidence of validity. Involving a set of 
facts almost identical to those in Griggs, the West Georgia Power Company 
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defended its use of tests on the basis of statistical techniques that were challenged 
as improper (166). The District Court decision in favor of the company was so 
confusing, however, that fearing the setting of unrealistic precedents, the Division 
of Industrial and Organizational Psychology of the American Psychological As- 
sociation filed an amicus brief on the subject of test validation (33). 

Another landmark case, notable because of the out-of-court large back-pay 
relief extended to the classes of individuals allegedly discriminated against, was 
the proceeding before the Federal Communications Commission by the EEOC 
against the American Telephone and Telegraph Company (153). The suit alleged 
both discrimination on race and on sex: that women were segregated largely into 
low-level telephone operator occupations, and that Blacks were excluded from 
higher-level maintenance jobs. At issue in the latter situation was a test battery 
which allegedly screened out a higher proportion of Blacks than of Whites. While 
the hearing delved in detail into the validity of the test battery, the settlement 
addressed itself to the matter of retributive pay and a revision of employment 
practices and goals to increase the opportunities of females and minority groups in 
the company, without respect to testing. 

A third important case, Arrington vs Massachusetts Bay Transit Authority (9), 
attacked the validity of the U.S.E.S. General Aptitude Test Battery for the selec- 
tion of toll collectors and bus drivers. It led to a major national research program 
funded by the Urban Mass Transportation Administration to develop a nondis- 
criminatory bus driver selection battery (20). 

In these and many other cases, psychologists played adversary roles as expert 
witnesses, undoubtedly leaving overall the impression that the validation of se- 
lection procedures is an art with respect to which no strong professional consensus 
yet exists. 


Females 


The inclusion of sex among the categories on the basis of which discrimination was 
to be prohibited was introduced into CRA 64 by senators who believed that it 
reduced the bill to an absurdity and would lead to its defeat. History has recorded 
otherwise: the Senate early and women later took it seriously. Title VII has 
provided a major avenue of redress for alleged sex discrimination. Although 
enforcement efforts have been less than vigorous (130), fully one-third of the cases 
filed before EEOC alleging discrimination have been brought on the basis of the 
sex provision (146), and of 500 EEOC bias complaints active in 1973 involving 
institutions of higher education, 45 percent involved discrimination on the basis of 
sex. Astin et al (19) document in extenso discrimination against women in almost 
all aspects of American society, including employment selection and promotion. 
Fidell (75), in a survey inviting psychology department chairmen to evaluate 
potential candidates for faculty appointment, demonstrated that male and female 
applicants with identical credentials, offering vitas differing only in the sex-linked 
name of the applicant (e.g. Jeanne Smith-John Smith) would be treated differ- 
ently: on the average, women were less likely to be considered for employment, 
and if considered would be offered lower-status positions at lower salaries than 
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men with the same credentials. Rosen & Jerdee (144), in a study of similar design, 
arrived at the same findings with respect to entry management positions in in- 
dustry, although to the extent to which the studies are comparable, the male- 
female differences in industry seemed to be smaller than those in academia. 

Discrimination against women has been based primarily upon sex stereotypes, 
not imputed ability or test performance differences, but Arvey & Mussio (10) did 
find that Civil Service tests showed unfair discrimination against culturally dis- 
advantaged females. Probably the most significant sex-discrimination case reach- 
ing settlement was the one involving AT&T (153), described above. 


Professionals 


Access to practice in professional occupations is a two-step process, involving the 
successful completion of a specified course of study in an institution of higher 
education leading to an appropriate professional degree, and successful passage of 
an examination by a state certifying agency. In the major professions—medicine, 
law, teaching, dentistry, pharmacy, etc—no significant challenge has yet been 
mounted against the educational requirement. The state-administered entry 
examinations, however, are increasingly becoming subject to challenge under 
Title VII and related laws. 


The Bar 


In respect to admission to the bar, challenges are now being mounted in about 19 
states. Typical proceedings are those in Alabama (3), California (73), Virginia 
(106), and Georgia (127). With minor.variations, these cases all raise three or four 
test selection issues: (a) the tests result in adverse impact—typically, 80-90 percent 
of white examinees pass as against 10-60 percent of Black or Spanish-surname 
examinees; (b) the system suffers from procedural discrimination—the examiner 
knows the race of the examinee and grades accordingly (this issue is raised in some 
cases only); (c) the tests are unreliable—typically, each question on the exam is 
written by one examiner who then grades all responses to that question; (d) the 
tests have no demonstrated validity against criteria of actual law practice. 

In relation to the criterion problem, it is alleged that the practice of Black 
attorneys, centered upon disadvantaged clients, is different from that of White 
attorneys. The examinations, it is alleged, reflect the kind of practice (corporate 
law, civil suits) more typical of white attorney practice. 

To date the courts of original jurisdiction have all turned back challenges to the 
state bar examination, but appeals courts have yet to be heard from. Psychologists 
have served as expert witnesses on both sides in some of these cases, and the record 
includes psychological analysis of the reliability and validity issues. 


The Education Profession 


Admission to the teaching profession is, like admission to the practice of law, 
conditioned upon completion of a prescribed course of education and a subse- 
quent certification by a state agency. Particularly in states in which a dual white- 
black system formerly existed, a second requirement has been added typically by 
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the state certification agency, such as (a) a master’s degree, (b) a specified number 
of graduate credit hours, or (c) special examinations. 

The special examinations route, however, has led to additional challenges on 
the basis that the examinations themselves are discriminatory on race and not job 
related (8, 21, 47). In Armstead vs Starkville Municipal Separate School District (8) 
the test at issue was the Graduate Record Examination. Representatives of the 
publisher, among others, testified that it was not designed or appropriate for 
teacher certification. In Baker vs Columbus Municipal Separate School District 
(21), in the process of converting from a dual segregated system to a unitary 
system, the district adopted a cutoff score of 1000 on the National Teachers 
Examination as a condition of hire or rehire. All the teachers denied reemploy- 
ment on the basis of failing the test had been recommended for rehire by their 
supervisors on the basis of performance, Adverse impact was evident: 90 percent 
of white examinees passed as against only 11 percent of black examinees. Expert 
witnesses for both sides testified that the validity of the test as a predictor of 
effective teaching was unknown. The court struck down use of the test. 

Little attention has yet been paid to the question of the validity of profes- 
sional certification exams. As in the bar cases, it is frequently argued that the 
criteria of successful professional practice are so diverse and elusive as to defy 
practical measurement; the entrance examinations must be accepted on the basis 
of the expertise of the peer professionals who devise them. In the face of adverse 
impact against protected minority groups, however, it is clear that challenges to 
professional entrance and certifying examinations will increase. 


Reading Ease 


Reading difficulty level is an often-neglected dimension of test construction, but it 
may be very important in determining an individual’s:ability to perform on an 
examination. In a little-noticed paper, Campbell (39) demonstrated that reducing 
the reading level difficulty of tests for police increased test-performance “riterion 
correlations. The issue of test fairness has renewed interest in the problem, 
particularly in merit system jurisdictions where a stipulated level of educational 
attainment—from none to high school graduation—is imposed to qualify for en- 
trance to an examination. If the exam requires a level of reading comprehension 
substantially higher than the job specification calls for, the possibility of unfair 
discrimination exists. Ash (15), using the SMOG index of readability (117) found 
that 36 of 76 examinations for positions with specified minimum educational 
requirements had reading level grades significantly higher than the educational 
requirement, and that of 58 tests for positions (frequently labor) with no minimum 
educational level requirement, 25 required a comprehension level beyond high 
school graduation. Ash suggests that the disparity between reading difficulty of the 
examinations and minimum educational qualifications for the job tends to make 
such tests racially discriminatory, given the disparity in average educational at- 
tainment between Blacks and other minority group members on the one hand and 
nonminority group members on the other. Hoffman (93) compared the Flesch, 
Forbes-Cottle (79), and Dale-Chall methods of estimating the reading grade level 
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of tests for tests used in the Pennsylvania State Civil Service, and concluded that 
the Flesch formula gave more consistent results. Of 22 tests subjected to reading 
ease analysis, six had reading grade levels slightly higher or higher than the job’s 
entrance qualification. 


METHODOLOGICAL DEVELOPMENTS 


Topics involving methodological considerations are prominent in the literature, 
including moderator variable research, sequential testing in decision making, and 
systems approaches to personnel selection. Considerable interest has also been 
displayed in the application of multivariate statistical technology to problems 
involving selection methodology. 


Multivariate Statistical Techniques 


Zedeck (176) has proposed the use of a multivariate discriminant analysis proce- 
dure to identify moderator variables and has compared the results derived from 
this procedure with the results obtained by the use of both the univariate simple 
algebraic and absolute difference techniques. The discriminant function has also 
been employed by Lissitz & Henschke-Mason (111) in an attempt to improve the 
- hit rate of a multiple discriminant classification procedure. 

Other multivariate research has been directed toward the solution of methodo- 
logical problems involving linear composite scores utilized in selection and clas- 
sification. Conger & Lipshitz (55) have developed a general index of reliability, 
called canonical reliability, for use with profiles. In the event that empirical 
weights are desired and factor analysis is required, the problem of determining 
what components of the model are invariant across a set of populations must be 
faced. Bloxom (27) deals with the invariance problem by developing special cases 
of the factor analysis model for four selection situations. 

The question of reliability has also been addressed in other forums. Most 
questionnaires and surveys yield data with weak scale properties and much mea- 
surement error. Gleason & Staelin (87) have derived a procedure whereby ques- 
tionnaire data may be transformed to reduce the error variance and to improve the 
metric properties of the individual variables. Cureton et al (61) reexamine test 
length in relation to the standard error of measurement. 

A degree of methodological cross-pollination characterizes many emerging 
disciplines, and selection is no exception. Of particular interest in this regard is 
O’Connor’s (123) application of biostatistical ee to personnel classifi- 
cation and turnover. 

The potential advantages of a sequential testing are for multidimen- 
sional dichotomous classifications have been investigated by Linn, Rock & Cleary 
(110). When comparing sequential testing strategies with conventional procedures 
with respect to efficiency in making dichotomous decisions, it was found that the 
latter require approximately twice as many items to achieve the level of accuracy 
obtainable by sequential tests. Rock, Barone & Boldt (143) have developed a 
mathematical solution to the two-stage selection problem which requires only 
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predictor criterion intercorrelations and selection ratio information. Theoretical 
solutions suggest that a considerable amount of testing time may be saved with 
little or no decrease in the validity of the selection procedure for various values of 
the selection ratios. 


Mathematical Modeling 


There has been only slight activity in the general area of the mathematical 
modeling of decision strategies in personnel selection. To facilitate the develop- 
ment of a computer decision simulation model, Smith (156) has analyzed and 
programmed thought processes used by a psychologist dealing with personnel 
selection and placement problems. He discusses several implications concerning 
test validation and general decision making. Holdsworth (94) discusses the prob- 
lem of failing to meet the underlying assumptions when modeling personnel 
selection decisions. 


Moderator Variables 


Several issues in moderator variable research provided the focus for a lively and 
informative exchange of views. Abrahams & Alf (1) describe several methodolog- 
ical difficulties encountered by investigators seeking to improve prediction by 
identifying and using moderator variables. The types of problems described in- 
clude misuse of regression equations, residual scores, and validity coefficients, as 
well as failure to perform a complete evaluation of the moderator’s actual contri- 
bution to prediction. Dunnette (68) criticized the arguments advanced by 
Abrahams & Alf against both quadrant analysis and Ghiselli’s "d technique. 
In addition, Dunnette suggested several modifications of moderator search tech- 
niques. Abrahams & Alf’s (2) reply involved a discussion of the limitations of the 
quadrant analysis modification and an evaluation of Dunnette’s proposed two- 
stage strategy for combining Ghiselli’s "d technique with a directional strategy. 
Ghiselli’s (84) moderator system for two predictors has also been criticized by 
Velicer (168) on the grounds of general inapplicability in practical prediction 
situations. Conger (53) concluded that the correlations between difference scores 
should be near zero for successful application of Ghiselli’s system. Velicer (168) 
has provided theoretical results as well as a practical example indicating that this 
correlation coefficient is usually much closer to unity. In view of this, the applied 
researcher is advised to examine alternative models for moderated regression. 
Zedeck (177) defines several alternative concepts of moderator variables in 
terms of whether they lead to differential validity or lead to differential predicta- 
bility, or involve moderated regression techniques. He discusses the dissimilarity 
of results yielded by the three approaches in terms of the types of variable 
employed as a moderator and in terms of certain methodological problems. He 
concludes that the utility of moderator variable research is limited by statistical 
problems, by the limited understanding of the statistical operation of moderators, 
and by lack of a rapid systematic approach to the identification of moderators. 
The position taken by Velicer (169) with respect to the conceptualization of 
moderator variables differs from that taken by Zedeck in that the former conceives 
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the moderator variable as a case of heterogeneous regression. The heterogeneous 
model yields several useful statistical tests of.significance as well as some indica- 
tions concerning the type of variables likely to be good moderators. 

Bartlett & O’Leary (22) illustrate the use of a differential prediction model to 
moderate the effects of heterogeneous groups. In fact, they present 11 different 
models utilizing the interactive effect between racial groups and validity. It is not 
surprising that improvement of predictor utility results from separate validation 
on all possible groups as opposed to validation on one large heterogeneous group. 
Lefkowitz (107) reports on the empirical use of ethnic group as a moderator in 
predicting tenure. 

In an attempt to stimulate thought in previously unexplored directions, Ghiselli 
(86) has suggested that the most important contributions of the findings with 
respect to the existence of the moderator effect may lie in the. implications for . 
measurement theory. Moderator research indicates that the measurable properties 
of human beings are interrelated in a dynamic fashion, requiring new dynamic 
conceptualizations about the qualitative description of human beings as distin- 
guished from the static concepts inherent in classical measurement theory. In 
addition, Ghiselli (86) contends that industrial psychologists may do well to focus 
their attention on the uniqueness of individuals as opposed to Ge g an emphasis 
on the uniqueness of categories of individuals. 


Utility of Selection Devices 


The evaluation of benefit obtained from selection devices is a problem of consid- 
erable interest. Schmidt & Hoffman (149) empirically compared three methods 
of assessing the utility of a selection device. In an earlier study, McCornack (116) 
compared three predictor selection techniques in an attempt to determine an 
optimal technique for use in cross-validation. Gross (91) attacked the problem 
analytically in terms of the gain from selection, which is defined as the standard- 
ized average performance of a group of subjects selected in a future sample using 
a regression equation derived on an earlier sample. He derived expressions for the 
density function and the expected value of the gain from selection statistic and has ` 
studied them in terms of sample size, number of predictors, and the prior distri- 
bution of the multiple correlation. 

In an applied study, Inskeep (100) used relatively simple statistical techniques 
based on existing employee records in order to identify changes in the selection 
process leading to a reduction in labor turnover. 


Systems Approaches to Selection 


Systems approaches to selection and manpower problems received a greater share 
of attention than in the past. Sands (147) developed the CAPER model, which 
provides the personnel manager with the information necessary to minimize the 
estimated total cost of recruiting, selecting, inducting, and training a sufficient 
number of persons to meet a specified quota of satisfactory personnel. Systems of 
employee selection have been discussed in terms of the relationship between job 
requirements and employee objectives by both Sledr (154) and Winicky (174). 
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Fleishman & Stephenson (78) developed a taxonomy to evaluate systems for 
describing and classifying tasks than can improve generalization of research re- 
sults about human performance. 

Astin (18), in discussing a manpower placement system for APA, speculates on 
the nature of the system’s basic features. Love (112) examines the problem from 
the individual’s perspective and suggests plans for updating current job-marketing 
procedures via a computerized job placement system. 


Miscellaneous Statistical Issues 


A number of studies related to traditional validation problems have appeared. 
Articles by Fossum (80) and Bryson (34) are representative of attempts to shorten 
tests and increase validity. Elshout & Roe (70) considered the general problem of 
generalizing correlational results from a sample, known to be drawn from a 
population whose range is restricted, to an unrestricted population. An interesting 
paper on the application of Jöreskog’s congeneric test model to the correction for 
attenuation problem is provided by Werts & Linn (172). 

The elusive suppressor variable also received a measure of attention. Conger & 
Jackson (54) note that some of the confusion surrounding it is due to the investi- 
gator’s failure to distinguish between its use in the prediction of a criterion, in the 
measurement of a construct, and in the delineation of relationships between 
constructs. They also explicate, definitionally and mathematically, the nature of 
the suppressor variable, and show the conditions under which there are formal 
identities between results based upon analyses using a suppressor variable and 
results based upon approaches involving multiple, part, and partial correlational 
procedures. 

The development of a selection validation model incorporating both ipsative 
and normative approaches has been proposed by Howell (97). The model assumes 
that individuals are unique, and that despite this they can be grouped into typol- 
ogies, where a type has meaning only to the extent that it is useful for measure- 
ment purposes. 

Studies involving the use of convergent-discriminant validation techniques 
were somewhat under-represented in the articles surveyed. The multitrait multi- 
method approach was used effectively by Campbell et al (41) in a study designed 
. to develop and evaluate behaviorally based rating scales. An interesting finding of 
the study is that a modified scaled expectations procedure, which is a performance 
rather than an effectiveness measure, yields less method variance, less halo error, 
and less leniency error than a summated ratings technique which is not behav- 
iorally anchored. 


SPECIFIC SELECTION TECHNIQUES AND AREAS 


This section deals primarily with four areas of current and continuing interest: 
assessment centers, the interview, biodata and the application form, and the 
prediction of proneness to theft among employment applicants. 

There was the usual spate of validity studies, usually of single jobs by single 
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predictors against single criteria. Above the mine run, Anderson & Roush (7) 
correlated both judgmental and output criteria against General Aptitude Test 
- Battery (GATB) scores for 76 coil winders and found that the published strategies 
for GATB use were ineffective in differentiating between good and poor workers. 

Corlett et al (57), on a very small sample, found the O’Connor Finger Dexterity 
Test and the Purdue Pegboard unreliable over time. Cory (58) found a Navy Maze 
Test superior to the Porteus Mazes for assessing Category IV personnel and 
significantly different regression lines for Black and non-Black Naval personnel. 

A few positive findings were reported for typical performance measures. The 
Eysenck Personality Inventory seems to contribute to the prediction of pass/fail in 
pilot training (102). The Navy Vocational Interest Inventory, on the basis of retest 
scores for Navy enlisted men obtained as civilians 6 years later, showed reliability 
and validity paralleling that obtained for the Strong VIB (104). Schwab & Packard 
(152) report that job applicants, as compared with employees, did not actually 
seem to try to distort their responses on the Gordon Personal Profile and Personal 
Inventory, calling into question the applicability of distortion research to the 
actual employment situation. 


Assessment Centers 


The growth of assessment centers and the development of materials for assessment 
center use continued strongly over the period surveyed. Blumenfeld (28) describes 
the history and operation of assessment centers and discusses reasons for the 
research findings that show assessment center procedures are likely to yield high- 
er validity than usual employment office procedures. Moses (119) describes the 
AT&T program, reporting a high degree of relationship between predictions made 
at an assessment center and subsequent progress in management, and also (120) 
an abbreviated Early Identification Assessment program designed to evaluate 
substantial numbers of short service employees. The results obtained are highly 
predictive of those yielded by the company’s more extensive Personnel Assess- 
ment Center Program. The economies effected are considerable. Dodd (65) re- 
ported high correlations between assessment center predictions and later 
management progress at IBM, and Huck (98) compared the external (on-the-job 
measures) and internal (predictor to assessment ratings) validities in assessment 
centers. Prusa (138) discusses the problems of the assessor in the context of a more . 
clinical approach than is found in American practice, and the kind of training 
likely to improve the assessor’s accuracy. In a provocative and argumentative 
treatment of the assessment of lives, Dailey (62) attacks the “... old objective 
methods [that] have failed us morally and scientifically” and argues for a human- 
istic model involving a human encounter between two equals. Assessment would 
be based upon the natural history of the person. Assessment center managers and 
assessors could read the volume with profit, if only to learn more about ways of 
eliciting and understanding life history data. 


The Interview 


Interest in the interview has continued unbated, but it would be optimistic to 
conclude that significant advances have been made. Fear (74) published a revision 
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of his 1958 volume on the evaluation interview. Notable particularly for the 
structure, directions, and guides for the interviewer, the techniques he advocates, 
if faithfully followed, will yield substantial data. His trait approach to analyzing 
applicants, however, and his profiles of traits and characteristics for a variety of 
industrial jobs seem to be based primarily on intuition and not particularly 
supported by data. The general excellence of technique could well be dissipated by 
unwarranted conclusions based upon a rather outmoded static trait analysis. 
Peskin (129) goes much further in emphasizing not data collection but complex 
inferences about personality dynamics that most interview research indicates is 
the least valid outcome of the interview process. Carlson et al (43) showed that 
selection interviews can be improved if they are carefully structured and inter- . 
viewers are trained. Schuh (151) found that training can significantly affect the 
interviewer's perception of the applicant. 

Corroborating a large number of earlier studies, a wide variety of factors were 
found to affect the interviewer’s decision process. Baskett (23) found that the 
interviewer takes into account both cues as to the interviewee’s competency and 
similarity of attitude between the interviewer and the interviewee; in an interview 
simulation, both tended to influence the interviewer’s hiring recommendation and 
hiring salary offered. Hollmann (95) reexamined interviewer errors in processing 
negative and positive information. He concluded that while both negative and 
positive information is processed with equivalent accuracy, insufficient weight is 
placed on positive information. This conclusion is already well documented, 
although Hollmann criticizes methodological shortcomings of previous studies. 
The amount of job information available also influences interviewers’ decisions, 
and particularly leads to higher inter-interviewer agreement. Langdale & Weitz 
(103) compared interrater reliability for interviewers who had an extensive job 
description with those who had only a job title. The former, presumably knowing 
what to look for in candidates, had far superior interrater reliability than the latter. 
Ledvinka (105) confirmed what a number of other studies have also shown: that 
there is an interaction between the race of the interviewer and the race of the 
interviewee. In this study, the reason given by black job seekers for leaving 
previous job was compared for black and white state employment service inter- 
viewers. Black interviewers elicited more reasons that involved rejection of the job 
by the worker or rejection of the worker by the employer than did white inter- 
viewers. Valenzi & Andrews (167) documented inter-interviewer disagreement 
in rating a group of 243 secretarial applicants for whom each of four interviewers 
received substantially a standard set of information cues. In part at least the rating 
differences could be attributed to differences in cue utilization. Wexley et al (173) 
reexamined the impact of contrast effects, and found them strongest when an 
average applicant was preceded by two highly qualified applicants or two low 
suitability applicants. It is overly simple, however, to generalize from the results of 
simulated interview situations with students to an actual personnel office 
operation. Given a set of open requisitions and line departments pressing for 
replacements, the pressure to fill jobs tends to be momentarily alleviated if a 
few obviously suitable people can be hired, but exacerbated if the office faces a 
string of obviously unsuitable candidates. 
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The employment interview is not only a data acquisition process. It is also an 
informational and public relations process, designed to influence suitable candi- 
dates to accept jobs. The interviewer and the interviewing organization can indeed 
take positive steps to achieve this end. Wanous (170) found that a realistic pre- 
sentation of job content and job expectations contributes to survival in the or- 
ganization. Ivancevich & Donnelly (101) found that reinforcement of a job offer by 
subsequent communication significantly reduced the rate at which college seniors 
rescinded job offer acceptances. 

The interview continues to be the principal selection procedure used in business 
and industry. As the standards for use of more elaborate psychometric devices are 
raised by government regulation, it is likely that the role of the interview will 
increase even more. The technology of increasing interviewer effectiveness is 
reasonably well known—it involves both training and the imposition of structure 
via guides and rating scales—but there is little cause for optimism that precisely 
those organizations that give up on psychometric approaches to selection because 
the cost of validation is too high will be motivated to do much about improving 
their interview process either. 


Biodata and Reference Checks | 


When Glennon, Albright & Owens published the Catalog of Life History Items 
(88), biodata, standardized and validated, seemed to give great promise for selec- 
tion. The last 3 years, however, saw little published on the use of biodata in 
employment selection. One possible index of dwindling interest was the expe- 
rience of the authors of the Catalog, who invited reports of validation studies of the 
items: not one study has been received. Asher (17) expresses reservations about 
the possibility of improving the biographical item. Roach (142) reported a sub- 
stantial loss in predictive power of a previously cross-validated weighted applicant 
blank for predicting tenure of clerical employees, attributing the loss in efficiency 
to changes in labor market conditions, manpower needs, and personnel policies. ` 
Federal and state guidelines on testing also raise questions about biodata. In the 
first place, it may be difficult to establish job-relatedness for many items frequently 
used; in the second, the problem of adverse impact for minority group applicants 
may seriously reduce the usefulness of many items which appear to be culture- 
specific. 

In a somewhat related kind of selection device, Carroll & Nash (44, 122) report 
on a forced-choice format reference check for clerical hires, which, after correction 
for the effects of five moderator variables (sex, longevity, job congruity, nation- 
ality, and race) yielded validities for the best subsets of items of .64 and .56 against 
a supervisory performance rating criterion. The reference check does seem to be a 
device for which the forced-choice technique is particularly appropriate. 


The Detection of Proneness to Theft 


In business and industries in which employees have largely unsupervised respon- 
sibility for money or merchandise (e.g. armored truck drivers, vending machine 
and bakery route drivers, grocery clerks, department store personnel), employee 
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theft can be a serious source of loss or “shrinkage.” Where security needs are high, 
many employers screen employment applicants by means of a polygraph (“lie 
detector”) interview, but an increasing number of states have banned the pre- 
employment use of the polygraph. The Reid Report (140) is a paper and pencil test 
based upon the notion that if the questions are asked properly, applicants will 
accurately reveal their attitudes toward honesty and their own defalcations, if any. 
A series of studies by Ash (11, 12, 14) led to an instrument validated (r = .6) 
against verified theft history. Using a moderate cutoff score, a base-rate expec- 
tancy of dishonest employee behavior of 40 percent is reduced to 15 percent 
among those selected. In a somewhat related area, Weiler & Weinstein (171) 
demonstrated significant differences in interview behavior as between qualified 
persons (honest performers) and unqualified persons (dishonest performers). The 
old Hartshorne & May findings that honesty is situation specific may be due for 
qualification or revision. 


CONCLUSIONS 


Federal and state regulation has become a permanent aspect of personnel selec- 
tion and placement, and litigation over allegedly discriminatory selection proce- 
dures is predictably going to increase. Several results seem likely. First, small 
organizations unable or unwilling to establish the validity of tests they use will give 
up the uncritical off-the-shelf use of tests—indeed, this trend is documented in 
several unpublished surveys. Second, the development of selection procedures on 
an industry-wide basis, combining small organizations and taking advantage of 
the economies of scale to apply advanced statistical techniques and assemble large 
samples, will come increasingly to the fore. This process may be noted currently in 
the banking industry, in the bus transportation industry, in police patrolman 
selection, and in national boards for professional accreditation. Third, painful 
though the medicine seems to be to some, legal pressure seems to be bringing to 
professional practice the teachings of the science in respect to standards for 
selection and placement. 

Although there is a small continuous flow of literature on “systems approaches” 
to selection, the reviewers find little evidence that systems rhetoric will be trans- 
lated into useful practice in the near future. We are more likely to see proliferation 
and application of alternatives to the old standby paper-and-pencil multiple 
choice instrument, such as Job Element examining, man-job matching, job- 
derived work sample and performance testing, and structured exercises like those 
now in use in assessment centers. We also expect that the assessment center itself, 
as a locus and procedure for assembling selection data, will be more widely repli- 
cated. 

Above all, however, we do not believe that the next review will find any greater 
increment of change than we have over the last—and that was small. As for the 
longer view, the perspective of the last half century commands cautious optimism 
for small improvements over the next half century. But the mind of man runneth 
not that far. 
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This review completes a three-part series in the Annual Review of Psychology 
under the new design for intervention in psychopathology. The first in this series 
appeared in 1973 under the heading Social and Community Interventions, and the 
second appeared in 1974 entitled Intervention Techniques: Small Groups. The 
present review on individual psychotherapy covers 1971 through 1973 and is the 
responsibility of Bergin. The material on behavior therapy covers 1970 through 
1973 and is the responsibility of Suinn. 


STATUS OF ACTIVITY IN THE FIELD 


Previous reviewers have noted the marked increase in publications focusing on 
behavior therapy, and clearly this movement has an identity and an energy level in 
a class by itself. For this reason, it seemed appropriate in this review to devote 
about half of the space to an account of behavior therapy research, with the 
remainder being devoted to research on all other forms of individual therapy. If 
one takes as a standard the broad spectrum journals such as The Journal of 
Abnormal Psychology and The Journal of Consulting and Clinical Psychology, or 
the major psychiatric journals, the frequency of journal articles in these two broad 
areas is about equal. 


OUTCOME 


In outcome studies, more attention is being paid to specifying the nature of 
therapeutic ingredients as they relate to changes in specific problems or symptoms. 


! The authors are indebted to the following persons who helped in a massive enterprise: 
Glenn Brasington, Philip Berghausen, Diana Berghausen, Jane Bever, Lori Banta, Sharee 
Bunker, John Collins, Davilyn Dalton, James Holley, Lois Huebner, Stephen Klein, Pam 
Knight, Joanne Moran, Paul Russell. This project was supported in part by a Research 
Division Faculty Research Fellowship, Brigham Young University. 
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In keeping with this trend, there has been a maturing of comparative studies which 
contrast two or more techniques and which may also consider commonalities ın 
divergent approaches to treatment. 


Comparative Studies 


THE TEMPLE STUDY Conducted by Sloane et al (316), this is probably the best 
comparative outcome study done to date. It involved more than 90 outpatients 
seen at the Temple University Health Sciences Center who were typical clinic 
outpatients generally similar to those described by Wolpe as “complex neurotic 
states.” They were assigned to behavior therapy or to short-term analytically 
oriented psychotherapy or to a minimal treatment waiting list group. The groups 
receiving each treatment were matched with respect to sex and severity of symp- 
toms but otherwise were randomly assigned. The three behavior therapists and the 
three analysts were well trained and highly experienced. Pretesting and posttesting 
utilized standard psychological tests, the target symptoms technique, a standard- 
ized interview, reports by informants who had known the patients for an average 
of 12 years, and ratings by the therapist, the client, and an independent assessor. In 
addition, tape recordings were taken of each fifth interview for the purpose of 
describing typical behaviors of the therapists. | 

At the d-month posttesting, all three groups, including the wait list group, had 
significantly improved on target symptoms, but the behavior therapy (BT) and 
psychoanalytic therapy (PT) groups had improved significantly more than the wait 
(W) group. There were, however, no differences between behavior therapy and 
psychoanalytic therapy on any of the target symptoms as rated by an independent 
assessor whose knowledge of the design was kept as blind as possible. On estimates 
of more general functioning such as at work and in social situations, the groups 
also improved but did not differ from each other in amount of improvement. In 
rating global outcome, the independent assessor rated 80% of the BT and the PT 
groups as improved, whereas only 48% of the W group was improved. On general 
adjustment, 93% of the BT group and 77% of the PT and W groups were rated as 
significantly improved. All three groups were found to have maintained their 
improvement after one year. The general trend was for improvement to continue 
and for the patients in the wait group gradually to approach or equal the therapy 
groups. This suggests that the main effect of therapy was to accelerate change 
rather than to produce more change than occurs without therapy in these cases. 
Comparative change between BT and PT varied across different criteria with a 
slight trend favoring BT. There was generally an absence of deterioration effects, 
which is predictable in light of the competence of the therapists. 

The analyses of tape recordings indicated striking ‘similarities and differences 
between these two small groups of therapists. The behavior therapists were more 
active and directive and talked more. The two groups were equal on warmth, but 
the behavior therapists received higher ratings on empathy and congruence as 
measured by the Truax scales. Interestingly, the therapists did not differ in fre- 
quency of interpretations. Also of considerable interest was the fact that the Truax 
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scores were unrelated to outcome, a more and more common finding which is 
discussed later in this review. 

The psychotherapists had better outcomes with patients who were less dis- 
turbed, whereas the behavior therapists did equally well with all levels of distur- 
bance. In addition, they tended to do particularly well with the more extroverted 
cases and those scoring higher on the MA, HY, and PD scales of the MMPI. This 
greater versatility for behavior therapy may have less to do with the BT techniques 
than it does with the fact that behavior therapists seemed to be more flexible than 
the psychotherapists and used a wider variety of techniques including psychoan- 
alytic, client-centered, and rational-emotive ones. This may explain why the be- 
havior therapists can handle a broader range of cases effectively. This study will 
undoubtedly become a landmark reference in the field and will be analyzed and 
_ debated for a long time. It is likely to be compared frequently to the Paul study 
(267) but is clearly superior in the sense of involving clinical cases representing a 
number of syndromes and treatment by experts in a natural setting. 


THE DILORETO STUDY Di Loreto (80) compared systematic desensitization, Ellis’ 
rational therapy, and Roger’s client-centered therapy. The study involved 100 
college student volunteers who reported high interpersonal anxiety and a desire 
for treatment. Twenty were assigned to each treatment group and an equal 
number to a placebo therapy group and a no-contact group, with approximately 
half of the subjects being introverts and half extroverts. All groups except the 
no-contact group were seen for 11 hours of therapy in group sessions consisting of 
five persons per group. The therapists were advanced graduate students who had 
experience in and commitment to each of the three main orientations. Evaluation 
included a multiple test battery and behavior observation ratings. 

All three types of therapy were consistently superior to the control groups in 
outcome. Systematic desensitization was equally effective with introverts and 
extroverts, while the other treatments tended to be more effective with introverts. 
Although the study included elaborate experimental controls and 72 pages of 
statistical analyses, the results are somewhat limited by the nature of the subjects 
and the therapists. In addition, commentators representing the three schools of 
therapy each criticized the way in which their own therapy was conducted. 

This study, however, does support the findings of the Temple study in that 
major differences in outcome as a result of different techniques were not evident. 
Since the phenomena and processes involved were somewhat closer to the “real 
thing” than in the Paul study, this adds further weight to the notion that the Paul 
results were not relevant to real-life problems. 


- OTHER COMPARATIVE STUDIES Luborsky’s (205) definitive review of more than 
100 comparative studies reveals little or no difference in the effects of a wide 
variety of therapy techniques and emphasizes the common components in them. 
Roback (293), in an overlapping review, concludes that there are no significant 
differences between insight therapy and behavior therapy outcomes. Allen (5) 
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examined the comparative effectiveness of desensitization and “study counseling” 
techniques in reducing test anxiety and increasing academic performance. The 
techniques were not different in outcome and were not better than a placebo. 
Patterson, Levene & Breger (266) compared outcomes of psychoanalytic and 
behavioral methods as taught to mental health trainees and found no differences 
between therapies at 3 months follow-up. A subsequent report (188) indicated 
similar results with a slight trend in favor of behavior therapy. 


PSYCHOTHERAPY VERSUS BEHAVIOR THERAPY The foregoing accounts suggest 
that the supposed major differences between the behavioral and traditional insight 
therapies are not as great as was once believed, either in outcome or process. This 
.is particularly true for the broad cross-section of typical outpatients and is proba- 
bly less true with respect to circumscribed disorders, such as special kinds of 
phobias, sexual disorders, etc. The earlier zeal in favor of behavior therapy with a 
broad class of neuroses was probably based upon misleading studies with mini- 
mally disturbed volunteers. 

Behavioral methods are loaded with affective and cognitive variables of the 
same kind that dominate other therapies, and the esoteric techniques employed by 
behavioral therapists are not always crucial in producing positive outcomes. It is 
interesting that when asked what helped them the most, the patients in the Temple 
study identified the interpersonal characteristics of the therapists. Such a view is 
strongly supported in a separate study by Ryan & Gizynski (300). Marmor (217) 
provides a sophisticated analysis of the differences and commonalities between 
the therapies, along with a penetrating critique of the weaknesses in the behavioral 
position. He believes that BT makes excessive claims and is based upon nay of 
the elements of traditional psychotherapy. 

Despite obvious successes, behavior therapy is not yet a cure-all, as is indicated 
by Lazarus’ account of where behavior therapists take their troubles (182). Of 20 
behavior therapists surveyed who were in personal therapy for themselves, 10 
were in psychoanalytic therapy, 5 in Gestalt therapy, 3 in bioenergetics, 4 in 
existential therapy, 1 in group dynamics, and not one in behavior therapy! In 
explaining their choices, different therapists made the following remarks: “I have 
decided to give the opposition a fair try.” “My therapist is a beautiful human being 
and that means more to me than his theoretical orientation.” “Let’s face it, if you 
can afford it in terms of time and money, psychoanalysis is still the treatment of 
choice” (182, p. 349). 


Outcomes with Specific Techniques and Patients 


THE MENNINGER STUDY Summarizing the Menninger study over the 20-year 
period since its inception in 1954 and the publications of results in recent years is a 
difficult task. Among the reports that have been published, two recent ones are of 
particular interest (Kernberg et al 164; Voth & Orth 349). The study included 42 
adult outpatients and inpatients. The objective of the project was to investigate the 
relationship of personality change to patient, treatment, and situational variables. 
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The patients were seen in either psychoanalysis or psychoanalytically oriented 
psychotherapy by experienced therapists. Those in analysis were seen for an 
average of 835 hours and those treated by psychotherapy for 289 hours.. Im- 
provement in life adjustment appeared to occur on a number of concrete indica- 
tors such as work status and soeial functioning, although a number did not seem to 
improve on these criteria and some seemed to be worse. On the Health-Sickness 
Rating Scale, which was devised specifically for this study, the majority of patients 
improved although there was no difference in improvement between those in 
- psychoanalysis and those in analytic psychotherapy. The absolute amount of 
change from pre- to posttesting was apparently significant, though ordinary out- 
come data are not provided. Voth & Orth (349) provide evidence of a number of 
failures to produce therapeutic improvement and also inadequate attempts to 
prevent further deterioration. 

This naturalistic study provides an enormous number of correlations. These 
indicate that: a high level of initial ego strength is correlated with positive change 
regardless of type of therapy, whether psychoanalysis, expressive psychotherapy, 
expressive-supportive psychotherapy, or only supportive psychotherapy. The 
feature of ego strength most predictive of outcome was the quality of interpersonal 
relationships. While high ego strength provides good prognosis for all types of 
treatment, psychoanalysis appears to produce the most improvement in such 
cases. Patients with low ego strength, when treated by therapists with high skill, 
improved significantly more when focus on the transference was high. Also, 
patients with ego weakness improve better when a special treatment is provided, 
namely supportive-expressive treatment. Patients with a low quality of interper- 
sonal relationships tended to do poorly regardless of type of therapy. Patients with 
severe ego weakness did better under an expressive approach which focused on 
the here and now of the transference and structuring of their life outside of 
treatment. Another significant finding was that the initial characteristics of the 
environment had no significant value as predictors of outcome. With respect to 
therapist skill it was found that highly skilled therapists succeed regardless of type 
of therapy, whereas the less skilled therapists do better with expressive therapy, 
perhaps because expressive therapy implies a standardized technique and does not 
require as much flexibility and involvement by the therapist as supportive therapy. 
The study also indicated that motivation for change and psychological minded- 
ness are not necessary prerequisites for analytically oriented therapy and that the 
“preparation” of the patient can be achieved within the early stages of treatment. 

A summary of the Kernberg (162) report has been reprinted in the International 
Journal of Psychiatry with critiques by Malan (212), Greenson (122), and May 
(224), and a reply by Kernberg (163). Criticisms of the study include: the fact of a 
highly selected patient population which biases outcome in favor of the therapy; 
the testing of an enormous number of hypotheses on a small number of patients; 
the strong reliance upon factor analyses for deducing variables for study which has 
been criticized by some observers as a “garbage-in and garbage-out process”; the 
fact that the ratings of therapist skill were not independent of knowledge of the 
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therapists and their outcomes; and the fact that it is difficult to draw any strong 
conclusions about the general effectiveness of the treatment in light of the absence 

_of an experimental design which includes control groups, etc. Over all, the study is 
commendable in its adherence to the clinical phenomena and its adaptation of 
clinical judgments to the needs of research. The correlation data have implications 
at least for the. practice of psychoanalysis and tend to support the notion that 
appropriate transference resolution enhances outcome. The results, however, are 
not generally dramatic and cannot be shown to be better than the results of 
placebo therapy or other therapies; therefore, the findings of this project are 
interesting but have not yet been demonstrated to have a profundity commen- 
surate with the enormous investment of time, energy, and funds over a 20-year 
period. 


THE MALAN STUDIES Malan (213) and colleagues at the Tavistock Clinic in 
London have conducted a series of outcome studies of psychoanalytically oriented 
therapy focusing upon assessing change via dynamic criteria which depend upon 
the skilled judgments of experienced clinicians. As in the Menninger study, the 
thesis is that meaningful results having implications for practice can only be 
obtained by remaining close to the complexity of live therapy and judging changes 
from a clear theoretical and clinical perspective. In this manner, criteria of positive 
change can be individually established for each case and then ratings can be made 
on a simple scale of improvement. 

Their results indicate that both psychoanalysis and brief psychoanalytic therapy 
can be highly effective for a small minority of patients, although the characteristics 
of these specific cases is not yet entirely clear except for the importance of 
motivation for insight. Perhaps the most important finding, and one that has been 
replicated, is that the frequency with which the therapist makes transference- 
parent interpretations (but not other transference interpretations) is positively 
correlated with outcome. This holds even for brief therapy, as it did for supportive 
therapy in the Menninger study. Together the two studies provide renewed sup- 
port for the importance of transference and its resolution in personality change, 
but it is still unclear when or with whom this is especially effective since many 
patients definitely improve without going through this process. 

Evidence of the latter is shown by careful analyses of dynamically sound 
changes in untreated cases and in persons seen for a single interview. The value of 
the psychodynamic assessment criteria is seen in the fact that 69% of untreated 
cases appear to have improved on traditional symptomatic criteria, whereas, when 
examined in terms of dynamic assessment, only 26% had improved—a valuable 
lesson for Professor Eysenck here! 


THE ROGERS-TRUAX-CARKHUFF THERAPIST FACILITATIVE CONDITIONS One of the 
most popular areas of research continues to be the analysis of empathy, warmth, 
and therapist genuineness or congruence and other therapist behaviors as they 
relate to process and outcome. In recent years, a number of studies have induced 
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skepticism concerning the potency of these variables except in highly specific, 
client-centered type conditions. Among recently completed studies, two indicate 
positive support for the notion of a relationship between empathy and outcome 
(256, 348). Several others failed to replicate this type of finding, including the 
study done at Temple and a major study (248) done on 30 patients with expe- 
rienced therapists. Similar findings have been obtained by Kurtz & Grummon 
(174), Winborn & Rowe (364), and apparently Di Loreto (80), although his 
content analysis does not precisely fit the paradigm in question. It could be that a 
number of the negative findings on empathy etc, are due to the fact that the 
therapists being studied were not employing a strictly client-centered technique 
and thus the Truax-type scales were not applicable even though the original claim 
was that these variables cut across schools. Perhaps these conditions are vital to 
positive change, or are at least correlated with it, but are not being measured 
appropriately. For example, in a number of studies showing no relationship 
between tape-rated empathy and outcome there were quite significant correlations 
between patient perceived empathy and outcome, usually as measured by the 
Barrett-Lennard Relationship Inventory. 

Many critical questions concerning the nature of these therapeutic conditions, 
as rated by the currently popular scales, have been raised by Chinsky & Rappaport 
(62), who carried on a debate with Truax (282, 346). They show a number of 
psychometric defects in the nature of the scales and question the validity of the 
outcome studies. They seem to have gotten the better of the argument despite 
some support of Truax’s? position by Bozarth & Krauft (51). Mintz & Luborsky 
(247), in their major process-outcome study, measured a large variety of therapist 
behaviors and in a factor analysis found a first factor which they entitled “optimal 
empathic relationship.” This variable included warmth, empathy, skill, relaxed 
style, spontaneity, sensitivity, and perceptiveness. Other factors included “direc- 
tive mode,” “patient health-distress,” and “interpretive mode with receptive pa- 
tient.” In analyzing 4 min segments as opposed to whole sessions, they discovered 
similar variables with much overlap in correlation except for the empathy vari- 
able, indicating that judgments of therapist empathy based on brief segments 
cannot be generalized to whole sessions! This finding is given strong support by 
Gurman’s study (126) in which three high-facilitative and three low-facilitative 
therapists were found to be extremely variable in empathy etc, both within hours 
and between hours. These findings may invalidate much of this program of 
research. Interestingly, Bachrach, Mintz & Luborsky (12) found that the Truax 
empathy scale correlated nearly perfectly with two other empathy scales, includ- 
ing Raskin’s. Since the Raskin scale is easier to rate and is more relevant to a wide 
range of empathy, perhaps it should be used more widely. 

One of the most stunning reports in this area has been provided by Mitchell et al 
(249) in a study of 75 expert therapists with 120 clients. Generally there was no 


2 We note here with sadness the regrettable and untimely death of Dr. Truax in December 
1973. 
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relationship of empathy or warmth and outcome, even when the most and least 
changed clients were selected. The findings indicate that between 32 and 71% 
improved, depending on the criterion. 


OUTCOMES OF OTHER METHODS New therapy methods continue to be invented 
and evaluated, and some older methods that have received little evaluation are 
beginning to be tested more systematically. Trexler & Karst (345) tested rational- 
emotive therapy versus placebo and no-treatment effects on public speaking 
anxiety in 33 students in a public speaking class. Like most studies of outcome with 
a single technique, there was at least modest support for the effects of the therapy 
on some criteria compared with the control groups, but there was no comparison 
with any other form of therapy. The problems were mild and changes were more 
evident on an irrational beliefs test and self-rating of confidence as a public 
speaker than on a behavioral check list or an anxiety scale. Fransella (100) adapted 
George Kelly’s theory and technique to the therapy of 20 adult stutterers and 
found that a majority improved, with 10 out of 16 tested at termination having 
reduced their dysfiuencies by 50%. Of those contacted at follow-up, about half had 
continued to improve. 

In an intriguing application of therapy to Harlow’s neurotic monkeys, Suomi, 
Harlow & McKinney (340) provided young monkey psychiatrists for the disturbed 
ones. These monkey therapists engaged in clinging and play behavior with the 
isolate monkeys who had been separated from their mothers and were 6 months of 
age at the time of treatment. At 1 year the isolates had greatly improved, and at 2 
years there was virtually complete recovery in monkeys whose defects were con- 
sidered irreversible. The authors and inventors of “monkey psychiatry” offer the 
following thought: “In conclusion, we are all aware of the existence of some 
therapists who seem inhuman. We find it refreshing to report the discovery of 
nonhumans who can be therapists” (340, p. 932). 

Other studies of interest include a well-advertised but artifact-ridden attempt 
to evaluate primal therapy (155); an extensive review (317) of outcome in brief 
psychotherapies; and a review (125) of the use of nonprofessionals as therapeutic 
agents. 

Feighner (94) attempted to evaluate the Russian method of using electrosleep 
therapy to treat psychiatric disorders. Although about one third of the patients 
treated appeared to improve, nearly all relapsed within a month, and four of the 
more depressed patients became significantly worse. 


STUDIES OF THERAPEUTIC CHANGE IN SCHIZOPHRENIA Outcome studies with 
schizophrenics continue to show mixed results (see 85, 102, 253). Much of the 
evidence seems to be on the negative side, but an exception is the study by Karon 
& Vandebos (156), who found a significant effect of individual psychoanalytically 
oriented therapy at termination and at follow-up 2 years later. They also found 
that patients of experienced therapists did better in the long run. 

An interesting study that has stimulated some controversy was done by Rosen- 
han (295) and colleagues. Eight sane people gained secret admission to 12 hospi- 
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tals of diverse quality and orientation. All reported to the admitting person that 
they had heard voices and then, while falsifying their identities, gave accurate 
personal case histories that were not seriously pathological in any way. All were 
admitted and immediately after admission stopped simulating any symptoms and . 
attempted to prove they were sane. All were eventually released but with a 
diagnosis of “schizophrenia in remission.” Rosenhan appears to think that this is 
exceptional, but it seems to be a perfectly natural diagnosis for people who have 
reported having hallucinations. 

In a related study, staff members were warned that during a 3-month period one 
or more pseudopatients would attempt to be admitted. During this period 193 
patients were admitted but no pseudopatient actually tried to get in and 41 
patients were believed, with high confidence by at least one staff member, to be a 
pseudopatient and 19 were suspected by two independent evaluators. Rosenhan 
presents this as evidence that is is impossible to distinguish the sane from the 
insane, yet the small, reliable error rate of 10% could well be an artifact of the 
‘instructions which gave the staff a set to watch for simulating. It may be that 
Rosenhan is correct in declaring that “the mentally ill are society’s lepers,” but it is 
hard to believe that his provocative and overstated study provides unique evi- 
dence of this. One has the feeling that his attack on hospital staffs is not new and 
that what is new is not true. 


Persisting Issues 


The question of whether therapy has any effect. continues to provoke strong 
opinions. Rachman’s book on The Effects of Psychotherapy (276) and a digest 
thereof (279) reassert the claims of Eysenck regarding (a) the ineffectiveness of 
“psychoanalysis and other traditional psychotherapies; (b) a two-thirds spontane- 
ous remission rate for neuroses; and (c) the superiority of behavioral methods of 
treatment. Some support is given to Truax and Carkhuff’s idea of utilizing 
Rogerian therapist behavior as reinforcers. In addition, Rachman attacks evidence 
for lower spontaneous remission rates and for improvement and deterioration 
effects presented by Bergin, Truax, and others, viewing these findings as threats to 
the antipsychotherapy, probehavior therapy position. May (223) also criticizes the 
evidence for deterioration effects. 


DETERIORATION VERSUS IMPROVEMENT Reviews of this same literature have been 
produced by Malan (211) and Frank (99), both of whom conclude that the 
cumulative weight of evidence for deterioration (and the opposite improvement 
effects) is substantial despite the fact that individual studies may be open to 
criticism. Additional evidence appeared in a truly monumental study of encounter 
groups by Lieberman, Yalom & Miles (191), in which 10 to 19% of a large number 
of participants were considered to be worse as a result of therapy, depending upon 
the criteria used, and this was significantly greater than any “spontaneous” nega- 
tive changes occurring among controls. Positive changes resulting from these brief 
but intense methods were not as great as usually occur in individual psychotherapy 
nor did they last as long. Using a cumulative change index, the authors estimate 
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that 1/3 changed positively, 1/3 did not change, and 1/3 manifested negative 
effects, if dropping out is counted as negative. Gurman’s (127) thorough review of 
marital therapy outcome revealed 5 to 17% deterioration in three studies and none 
in 8 others. Across a total of 15 studies, he found an improvement rate of 66%. The 
most striking new data are presented in a definitive report by Ricks (289), who ' 
studied the adult status of several disturbed adolescent boys who had been seen by 
two therapists in a major child guidance center. While the long-term outcomes of 
these two therapists was not different for less disturbed clients, there were striking 
differences in their therapeutic styles and outcomes with the more disturbed boys. 
The more successful therapist was labeled “Supershrink” by one of the boys, and 
Ricks retains this term in describing his techniques versus those of therapist B 
(whom we might call “Pseudoshrink”), most of whose cases became 
schizophrenic. 

The weight of evidence and the views of reviewers (31, 99, 127, 211, 347) support 
the notion of significant improvement and deterioration effects in psychotherapy, 
with improvement rates averaging about 67% and deterioration rates about 10%. 
However, these averages obscure wide variations in figures. It should be noted 
here that deterioration resulting from behavior therapies has not been reported, 
but generally it has not been looked for. 


SPONTANEOUS REMISSION Malan (211), in his thorough and scholarly review of 
outcome research over the past 20 years, agreed with Bergin (31) that the 
Eysenck-Rachman position on spontaneous remission is weak, that Eysenck’s 
own figures do not add up to a two-thirds spontaneous change rate, and that true 
rates are probably much lower than this. However, he describes Bergin’s alterna- 
tive figure of 30% as insecurely based and calls for properly controlled studies of 
the natural course of neurosis, a reasonable opinion in light of the quality of much 
of the evidence. Subotnik (333) reviews the relevant studies and concludes—sup- 
ported by his own study (334)—that spontaneous remission is small or nonexistent 
(see also 191). Following Eysenck’s mathematical formula for spontaneous re- 
mission, Gurman (127) noted that the rate for marital therapy should be 16% after 
17.5 weeks, which was the average length of treatment, whereas the improvement 
rate for treated cases was 66%—quite a difference! Indeed, applying Eysenck’s own 
formula to most controlled outcome studies shows similarly dramatic differences 
between experimentals and controls, which proves the opposite of what he and 
Rachman claim. Therapy rarely lasts 2 years, which is the amount of time the 
formula requires to reach a 67% improvement rate. 

This recalls the fact, however, that in numerous studies large differences 
between treated and untreated cases at termination tend to wash out over long- 
term follow-up. Conceivably Frank is correct in suggesting that in many cases the 
effect of therapy is to accelerate or intensify change rather than to create greater 
absolute change than would eventually occur through “natural” healing processes. 
The Hopkins group (189) have found this for the second time in another long-term 
follow-up study. This process of accelerating change may also account for the few 
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cases where behavior therapy outcome appears better than that of other therapies, 
though this potential for making equal change in shorter time (i.e. efficiency) 
should not be ignored. 


GENERAL OUTCOME ANALYSES Luborsky et al (203), in a thorough and penetrat- 
ing review of 166 outcome studies, evaluated the factors that correlate with 
positive change. They found that most prognostic studies deal with patient factors. 
Initially sicker patients do not improve as much as the initially healthier. Schizoid 
trends were especially negative prognostic indicators. Motivation, anxiety, edu- 
cational and social assets, and experiencing levels in early sessions all appeared as 
strong correlates of outcome. Therapist correlates included experience level, atti- 
tudes and interests, empathy, and similarity of patient and therapist. Technique 
factors included number of sessions, but not type of therapy. Unfortunately, 
nearly all of the change criteria in these studies were dominated by therapist 
ratings; thus the predictions are of mainly one Kind of change and a particularly 
suspect one at that. Eysenck (89) criticized this paper and, repeating his mathe- 
matical formula, claimed that the changes studied by Luborsky et al were not 
shown to have anything to do with psychotherapy. Luborsky (201) provided an apt 
reply, to which we might add that those 166 studies reveal far more change per unit 
time following treatment than Eysenck’s formula predicts for spontaneous 
changes. Eventually this dubious equation should be quietly buried. 

The Shore & Massimo study (312) is now in its tenth year of follow-up, showing 
continued improvement in the treated group and continuing deterioration in most 
of the control cases. This study provides evidence that untreated delinquent boys 
tend to deteriorate much more than most neurotics, suggesting that amount of 
spontaneous deterioration is probably symptom specific. Rachman and Eysenck 
seem now to agree that spontaneous remission rates also must be symptom specific 
and that general rates are inappropriate, even though they do not carry the logic of 
this position through to its normal conclusion in their writings. 

Several reviews evaluate further the general question of therapeutic effects and 
provide a number of stimulating suggestions (69, 173, 255, 307). In his compre- 
hensive review, Malan (211) concludes that (a) the evidence for the effectiveness of 
psychotherapy is now relatively strong; (b) dynamic psychotherapy is effective in 
psychosomatic conditions; and (c) dynamic psychotherapy with the ordinary run 
of neuroses and character disorders is weak, except for a small percentage of apt 
clients who have strong motivation for insight and the ego strength with which to 
work through a strong transference relationship to successful resolution. 


CRITERIA AND ASSESSMENT OF CHANGE Outcome measurement continues to at- 
tract interest and defy clarity. Garfield, Prager & Bergin (108) reaffirm the diffi- 
culty in obtaining consensus across multiple criteria and also find further support 
for the notion that clients and therapists tend to rate outcome as better than 
observers or other more objective methods do. Fiske (97) and Luborsky (200) offer 
criticism and wisdom on these points and are responded to by Garfield, Prager & 
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Bergin (109). Berzins, Bednar & Severy (36) give a new perspective on this 
problem with an excellent study of 79 therapist-patient dyads evaluated on 22 
measures. Their elaborate statistical treatment reveals that canonical correlations 
are more appropriate than factor analysis for analyzing the amount of intersource 
consensus in diverse measures of therapy outcome; and they show that higher 
agreement is obtained in this way. Mintz (245) conducted an ingenious study in 
which contrived cases were judged for amount of change by experts. Better 
prognosis patients were judged more improved even though the actual amount of 
change was equal across prognostic types. A subsequent experiment showed that 
judges value changes at higher (healthier) levels even though-they are metrically 
. smaller. Success ratings were thus influenced mainly by posttreatment level re- 
gardiess of amount of change. He concludes that initially better individuals are 
rated as having improved more when global posttreatment outcome ratings are 
used; whereas the initially worse cases do better when gain scores are used, i.e. the 
more disturbed patients change more and the less disturbed end therapy at a 
healthier level, and judges value the latter more. 


THERAPIST, CLIENT, AND PROCESS 
Therapist Variables 


THE A-B DIMENSION Considerable activity persists on the A-B variable, but this 
area is plagued by ambiguities and poor studies, many of them doctoral disserta- 
tions employing analog designs. It seems increasingly likely that the A-B measures 
will go the way of others, such as the Fascism (F) scale developed during the 
authoritarian personality studies. These deceptively simple and easily scored 
scales attract investigators, and numerous studies are then published which later 
seem trivial or misleading when the real complexity of the phenomena under 
study becomes clear (50). Some investigators, however, are convinced that the 
dimension is a viable one. Berzins has produced some elegant studies, though he 
now has doubts about the variable. In an analog using undergraduates (84), A 
therapists responded with greater immediacy to schizoid statements and Bs re- 
sponded similarly to neurotic statements. Studies of large numbers of profession- 
als and nonprofessionals seem to support the idea of personality differences 
between As and Bs (35, 37). As score higher than Bs on inhibition, harm- 
avoidance, and submissiveness and lower on sentierice, dominance, openness and 
fun-loving. As tend also to score higher on feminine items and Bs higher on 
masculine ones. Berzins et al (39), applying the dimension to an actual treatment 
situation with 6 therapists and 57 clients in brief therapy with college students, 
found that Bs did better with neurotics than schizoids, while As unexpectedly did 
well with both patient types. 

Bednar & Mobley (29) had experienced therapists rate working with schizo- 
phrenics and neurotics on a semantic differential and other scales. Differences 
between As’ and Bs’ ratings of these patients in terms of esteem, warmth, and 
liking were minimal or nonexistent. Beutler et al (40) found that As had higher 
empathy scores with schizophrenics than neurotics, but Bs’ empathy did not differ 
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for these different patients. No differences were noted on warmth and genuine- 
ness. 


THE TRUAX AND CARKHUFF MEASURES These methods have been discussed in the 
section on outcome. Empathy, warmth, etc appear to be trainable and learnable 
provided that a modicum of talent for interpersonal relating already exists. It is 
clearer now that these variables are not as prepotent as once believed; but their 
presence and influence is ubiquitous, even showing up strongly in behavior ther- 
apies. 


OTHER VARIABLES Henry, Sims & Spray (140) have produced the second volume 
of their comprehensive study of nearly 4000 practicing therapists in New York, 
Chicago, and Los Angeles. The authors present an endless array of data on the 
backgrounds, beliefs, and behaviors of this select group. Many will be interested in 
the fact that, on the average, therapists’ sexual behavior (as self-reported) is 
neither precocious nor extreme. Another unexpected finding is that two thirds of 
the therapists refer to their colleagues in terms of function and orientation rather 
than profession (psychoanalyst, psychiatrist, clinical psychologist, or social 
worker). When rating each others’ professions in terms of character styles, clinical 
psychologists are seen as aggressive, competitive, and overly intellectual; psy- 
chiatrists are viewed as rigid and authoritarian; and social workers are described 
as people-oriented and passive. When listing professions in terms of positive and 
negative traits, analysts are seen most positively and psychiatrists most negatively 
in terms of proportion of plus and minus traits. Clinical psychologists, however, 
are rated as having the largest absolute number of negative traits. Clinical psy- 
chologists are most negative toward psychiatrists and most positive toward 
analysts; whereas analysts are most negative toward clinical psychologists and 
seem to feel most threatened by them. In terms of love, therefore, the analysts and 
social workers seem to come off best. 

Of 3400 therapists responding, 71% considered themselves to have some form of 
psychoanalytic orientation as their dominant mode, with only 13% primarily 
group or community oriented. Behavior therapy preference was virtually nonex- 
istent. These data, of course, reflect one of the main defects of the report, 
namely, that the opinions were gathered in the middle 1960s and are probably 
unrepresentative of current views and practices. 


Client Characteristics and Process and Outcome 


Prager & Garfield (273) found little or no relationship between a variety of initial 
client measures and a variety of outcome indices. Critiquing this and other studies, 
Miller & Gross (241) argue that the relationship of initial disturbance to amount of 
change is curvilinear; thus, studies showing a positive relationship may have been 
sampled from a different portion of the disturbance distribution than studies 
showing a negative or zero relationship. Heilbrun (134) found that briefing clients 
on what to expect regarding counselor directiveness versus nondirectiveness 
yielded higher client satisfaction. Auerbach, Luborsky & Johnson (9) rated 47 
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patients on 31 variables and found quality of outcome related to (a) general 
emotional health, (b) intellectual achievement, (c) acute depression, (d) attrac- 
tiveness, and (e) interviewer’s global estimate of change; however, the correlations 
were low (.27 to .37). Rickers-Ovsiankina et al (288) found client expectations of 
therapy to diverge markedly from those of therapists; and Berzins & Herron (38) 
provide a factor analysis of therapist expectations of client role behaviors. Wilkins 
(360), in a hard-hitting critique of expectancy studies, asserts that expectancy is a 
poorly defined construct and has not been proved to influence outcome. 

Additional valuable data on client characteristics is provided in the second 
edition of Eysenck’s Handbook of Abnormal Psychology (90), which deals exten- 
sively with issues of classification, definition, and measurement important in 
establishing criteria of change. 


Race, Class, and Sex 


RACE Client and therapist race has become a prominent topic of recent research, 
with most studies focused on black and white participants. Banks (18) found that 
same race client-therapist pairs yielded the highest rapport ratings and degree of 
self-exploration in initial interviews. Empathy levels of therapists, however, were 
more strongly related to client ratings of rapport than racial similarity. Cross-race 
pairing seemed to work better with a white therapist and black client than vice 
versa. Bryson & Cody (53) obtained very similar results. 

W. E. Gardner (106) found that black students rated counselor’s facilitative 
conditions as higher when race was the same, but Cimbolic (63) failed to find such 
a result. Williams (361) found client racial differences to be unrelated to attitude 
toward psychotherapy, but blacks disclosed less than whites during interviews, 
regardless of therapist race. Krebs (172) found no evidence of negative effects 
of a white hospital staff on black patients compared with whites. Similarly, Win- 
ston, Pardes & Papernik (366) found that black inpatients did as well or better than 
white inpatients with white therapists. L. H. Gardner (105) provides an overview 
of this general issue. 


CLASS Goldstein’s (118) psychotherapy for the poor is one of the most elaborate 
and empirically rigorous attempts to adapt therapy methods to the needs of less 
privileged clients. He adapts the language and methods to their style, leaning 
heavily upon modeling, role playing, and other behavioral techniques. He has 
assembled an impressive array of tapes and instructions for developing 
interpersonal skills among these clients and also provides an outline of a para- 
professional training program. Mayer & Timms (228) found that working class 
- clients preferred therapists with better interpersonal skills and a capacity to in- 
tervene directly. They disliked the more aloof, nondirective, insight-oriented 
types. Strupp & Bloxom (331) have developed an effective role-induction film for 
preparing lower class patients for group psychotherapy. Warren & Rice (352) and 
Lorion (196) also provide suggestions for improving methods with these clients, 
while Lerner & Fiske (187) show that poor prognosis for lower class clients has 
more to do with therapist attitudes than client attributes. Also, Lerner (186) 
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provides a penetrating critique of therapy in the ghetto, along with substantial 
statistical evidence that white therapists can have significant effects on lower class 
blacks and severely disturbed clients. 


sex Sex differences in outcome have not usually been found in previous years, 
but some investigators feel that many former studies can and should be reexam- 
ined in terms of sex role dimensions. Heilbrun’s studies (133, 135) focused on 
undergraduate females and found that girls were more likely to defect from 
therapy with males to whom they had disclosed more. Riess (291) found that 
patients remain longer with female therapists and that female/female dyads 
continue treatment longer than male/male dyads, though the majority of patients 
prefer or request male therapists. 


Process Studies 


In his forward to Kiesler’s (165) unique book summarizing and analyzing all of the 
important process analysis systems in psychotherapy research, Strupp asks: “. . . If 
the therapeutic force is somehow encoded in the messages, where shall we look for 
it?” Reading Kiesler’s treatise is certainly a good place to begin the search, since 
each system and a critique of the research done on it is included. Regrettably, 
many short-lived efforts abound in this field, and process analysts are still at- 
tempting to capture new “essential” ingredients of therapy. 

Matarazzo & Wiens (221) provide an excellent overview of one venerable 
program of research on speech interaction in dyads. Much of the data gathered 
since the early studies in 1954 focus on formal properties such as utterance 
durations, frequencies, latencies, interruptions, proportions of therapist-client 
talk, etc; but more recent work reveals some relationships between formal speech 
units and content variables. The latter is a valuable trend that needs extension into 
the meaningful content so prevalent in therapy interviews. Siegman & Pope (314) 
similarly summarize and update their long series of studies and the work of others 
on dyadic communication in interviews, and continue to add rigorous data (272) to 
this cumulative effort. 

The prominent series of studies on the classic Rogerian conditions has been 
expanded to include a wide spectrum of presumably facilitative dimensions such 
as immediacy, concreteness, intimacy, confrontation. Carkhuff (56) summarizes 
some of this work and extends it into many areas of human need, focusing 
especially on the provocative idea that training clients in these therapist skills is a 
preferred mode of treating disorders and social deficiencies. 

In a variety of process.studies, Mintz et al (246) found therapists, patients, and 
observers able to attain moderate agreement regarding “goodness” of sessions on 
the Orlinsky-Howard instrument; Griest, Klein & Van Cura (123) demonstrated 
the value of a computerized diagnostic interview that even psychotic and suicidal 
patients can respond to; Lauver et al (178) show that clients can directly modify 
counselor speech and silence behavior; Packwood & Parker (265) reveal new 
methods for measuring therapist reinforcement and persuasiveness; McCarron & 
Appel (230) and McCarron (229) show relationships between verbal and auto- 


524 BERGIN & SUINN 


nomic responses during therapy sessions; Lassen (177) shows the effect of physi- 
cal distance (personal space) on client openness and facility of communication 
(better at 6 feet); Hekmat (137) shows that reflections are better reinforcers than 
interpretations in a quasitherapy interview; Luborsky et al (204) provide a meth- 
od for quantifying transference; and Luborsky (202) reports a careful study of 
momentary forgetting in psychotherapy. 


CHILD THERAPY 


Elaborate research programs on child psychotherapy are still not the norm, and 
one tends to find more data on behavioral treatments with children, perhaps 
because such studies are easier to do and the simplicity and manipulative nature of 
behavior therapies make them more readily adaptable to children. Davids (73), 
for example, documents improved effectiveness with psychotic children after 
switching from psychotherapy to behavior therapy. He also presents evidence for 
the effectiveness of paraprofessional college students as therapeutic models for 
underachieving adolescents. Miller, Barrett & Hampe (242), however, found no 
’ differences between psychotherapy and desensitization with nonpsychotic, phobic 
children, though both therapies were better than a control group. 

Rosenthal & Levine (296) showed that brief therapy can be as effective as longer 
therapy for selected cases but they do not define which ones. Wright, Truax & 
Mitchell (371) found that therapist facilitative conditions were difficult to rate 
reliably in play therapy due to too much-nonverbal interaction, but Siegel (313) 
obtained high reliabilities under similar circumstances. Schopler et al (304) found 
autistic children to respond better to a structured than an unstructured therapy. 
McConville, Brian & Purohit (231) devised a new approach to classifying chil- 
dren’s disorders that may improve clarity and the ability to measure change. 
Jacob, Magnussen & Kemler (148) provide data on the long-term course of 
children’s disorders that have been minimally or maximally treated, and show the 
relevance of these observations to the spontaneous remission and deterioration 
controversies. 

Finally, Klein (167-169) has assembled comprehensive and very useful bibli- 
ographies on this subject matter totaling 1667 references through 1972, includ- 
ing international items and dissertations. 


ISSUES AND INNOVATIONS IN METHODOLOGY 


Perhaps the most important methodological trend is toward designing research that 
is close to the clinical phenomena while also providing objectivity and quantifi- 
cation not present in traditional case histories. This has been the thrust of the 
Menninger study which (a) produced the Health-Sickness Rating Scale, (5) ap- 
plied the paired comparisons method to clinical judgments, and (e) provided the 
first utilization of multidimensional en analysis to psychotherapy (294, 
310). 


é 
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Perhaps the most widespread version of this trend is the development of so- 
phisticated single-case methodologies. Such approaches have the advantage of 
providing results that are most readily applicable to practice and reveal more 
clearly the causes of changes. Many of these methods derive from behavior 
therapy experiments but they have wide relevance and applicability. Gottman & 
Leiblum (121), Gottman (120), and Barlow & Hersen (22) provide some of the 
most interesting and exciting possibilities for this type of research. Additional 
paradigms and examples are provided by Leitenberg (185), Shontz (311), and 
Bergin & Strupp (32). May (225) also presents mathematical models for monitor- 
ing patterns of change over time along with further suggestions for the design of 
outcome assessments (226, 227). Weissman et al (357) suggest ways of quantifying 
the dosage and quality of psychotherapy so that inputs across cases or techniques 
may be equated. 

Generally, the new suggestions for methodologies are most encouraging, and if 
followed, should yield doctoral dissertations and programatic research studies of 
much greater practical significance than heretofore. Some of the important trends 
in research and in where the likely payoffs may exist have been discussed in detail 
by Bergin & Strupp (32) and more than two dozen prominent researchers during 
an elaborate study of the status of the field and the feasibility of establishing large, 
collaborative studies. 


BROAD SPECTRUM TREATISES 


Prominent among the broad overviews of the literature and theoretical sugges- 
tions that have been published recently are the second edition of Frank’s (99) 
important interpretation of psychotherapy phenomena; Strupp’s (329) collection 
of his papers and summations on clinical, research, and theoretical issues; the 
Aldine Annuals on practice and research which presumably select the best articles 
for each year (216, 220); and the Bergin-Strupp (32) synthesis of the literature. 

Strupp continues to develop incisive theoretical interpretations of the basic 
ingredients in therapeutic change (328, 330); Holt & Peterfreund (146) have 
initiated an important new series relating psychoanalysis and contemporary 
science; and Mahrer & Pearson (210) have collected creative contributions to 
psychotherapy with extensive commentaries by prominent workers in the field. 

In general, this review has arrived at the surprising conclusion that evidence 
favoring behavior therapy (especially desensitization with complex cases) and the 
Rogerian conditions has declined while the empirical status of psychoanalytic 
therapy has improved. Differential effectiveness of techniques is not well estab- 
lished. Also, neither old nor new style groups have been shown to be superior to 
individual therapy. On the other hand, psychotherapies of all kinds are much 
more clearly established now as superior to untreated and placebo controls, 
though this difference in outcome appears to decline with time. This small batch of 
facts is of considerable import and is likely to stimulate new developments in 
theory and in technique experimentation. 
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BEHAVIOR THERAPY 


The field of behavior therapy has continued to mushroom since it was reviewed by 
Krasner (171) in the Annual Review of Psychology (1971). Some attempt will be 
made here at a representative sampling since the field has reached such promi- 
nence, but priorities will be given to some areas because of space limitations. 


Operant Conditioning 


The operant approach involves the systematic use of positive consequences to 
shape desired behaviors. As in the past, research on this topic has continued to be 
among the top two in terms of sheer quantity. An enormous amount of writing 
involves the application of operant conditioning to a variety of clinical problems. 
Among the problem behaviors examined have been anorexia nervosa (41), asser- 
tive or prosocial behaviors (132), compulsions (222), incontinence (166), hysterical 
blindness (152), mutism (262), obesity (326), phobias (141), psychotic behaviors 
(131), and self-help behaviors (96, 193). Some of the procedures applied to pho- 
bias are interesting inasmuch as they offer an alternative to desensitization. Barlow 
et al (21) used social approval to reinforce approach behavior to a snake, and 
discovered the procedure to be more effective in increasing such behavior but no 
different from desensitization in reducing GSR measured anxiety. Shaping was 
found facilitated by therapist presence or by modeling. Other studies provide 
further insights into the use of reinforcement to deal with problem behaviors. 
Browning (52) discovered that a reinforcement program failed with autistic chil- 
dren because the learned behaviors did not generalize to new cues outside of the 
institution. Ravensborg (285) concluded that chronic schizophrenics increased 
social awareness behaviors only with the combined reinforcement of money and 
praise and not with praise alone. Lovaas, Litrownik & Mann (197) demonstrated 
that self-stimulation in autistic children is reinforcing. In a related sense, the 
question of what might constitute a reinforcer for autistic children was involved in 
a study by Steeves, Martin & Pear (323), in which they discovered that time-out 
itself might be a type of reinforcer for some cases. 

There appears to be increasing use of paraprofessionals to conduct operant 
programs in clinic settings (129). Kass, Silvers & Abrams (157) utilized patients 
themselves, who specified one another’s target behaviors and established rewards 
and penalties for behavioral change. Similarly, Nelson, Worell & Polsgrove (260) 
wrote contracts involving disturbed children as behavioral change agents for one 
another. These contingency contracts clearly specified the consequences of con- 
crete behaviors for both behavioral agent and target child paired together through 
the contract. Paraprofessionals are not always able to conduct programs well 
without receiving some payoffs themselves. Thus hospital aides showed substan- 
tial changes in their use of contingency work with patients when reinforcers were 
also distributed to the aides themselves (158). 

The use of token systems of reinforcement has become a technique in itself. 
Pomerleau, Bobrove & Harris (271) describe specific issues related to the estab- 
lishment of a token economy ward, with attention to costs, fines, incomes. John- 
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son, Katz & Gelfand (150) add an important aspect to the literature in describing a 
rationale and training procedure for undergraduate behavior technicians. Some 
evidences of the success of token programs continue to be reported. Henderson & 
Scoles (139) report that patients from a token ward had a shorter average length of 
hospitalization. Hersen et al (144) recommend the use of reinforcement tech- 
niques in hospital wards to prevent young psychiatric-patients from slipping into 
chronic hospitalization. Along with these continuing signs of successful interven- 
tion, more studies have appeared that attend to problems in implementing token 
programs. Allen & Magaro (4) criticize the indiscriminate establishment of token 
programs without attention to which types of patients would respond most favor- 
ably. They argue that some patients will emit the desired target responses even 
without reinforcement, some will not participate even for reinforcement, while 
others will participate only when tokens can be earned. They conclude that “the 
token economy has the most impact upon this last group” (4, p. 316). 

Hall & Baker (128) summarize several potential sources of failure in token econ- 
omy programs: the inability of the ward staff in some hospitals to deny acceptance 
of a patient with poor prognosis for success on a token system; the unwillingness of 
some token economy staff to use the system appropriately to shape behaviors; the 
low morale created if the token ward director fails to spend adequate time in 
supervising the ward’s progress; poor support from hospital administrators in 
preventing turnover of ward staff; and the interference of family members in 
influencing patients towards relapse. Kazdin (160) adds other possible reasons for 
failure of a patient to respond to token systems, such as the inadequacy of back-up 
reinforcers, the availability of other behaviors which the patient can show to 
obtain reinforcement from other sources, the absence of the target behavior in the 
repertoire of the patient, or the conflict between ward reinforcement and peer 
reinforcement. Atthowe (8) adds his concern for better planning based upon more 
individualized program design and less reliance on mechanical interventions. 
Winckler (365) offers specific insights deriving from economic men regarding 
the better control of reinforcers. 


Desensitization 


Desensitization involves breaking a stimulus-response association through the 
development of an incompatible response to the same stimulus. Typically the 
client is exposed systematically to anxiety hierarchy cues, with the incompatible 
response being mainly relaxation. Research on desensitization continues to appear 
at a high rate in the literature as it has in the past. One apparent new trend seems to 
be in the journals now accepting such studies. In the past, operant conditioning 
research seemed to be more attractive than desensitization studies to journals 
other than those which specialized in behavior therapy, but lately desensitization 
reports are appearing with greater frequency in nearly all clinical journals. 

As might be expected, desensitization continues to be applied toward the 
treatment of phobias or specific anxieties. Reports on the treatment of fear of 
snakes has received such attention as to precipitate minor controversy over the 
heuristic value of the topic (23). A new contender for researchers’ interest seems to 
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be test anxiety. Test anxiety is being investigated as much for being a clinical 
problem needing resolution as for being a focus for examining the variables 
affecting desensitization outcomes (68, 81, 101, 250, 275). Other than such direct 
fears, a review of the literature does uncover some other applications of desensi- 
tization. Rimm et al (292) reduced inappropriate feelings of anger through the 
application of desensitization. Schnurer, Rubin & Roy (302) found increases in 
caloric intake in a case of anorexia nervosa treated through desensitization. 
Handwashing compulsions were successfully treated by Rackensperger & Fein- 
berg (281). Depression and fatigue seemed to lift as a result of desensitization for 
assertiveness by Schrauger & Katkin (305). In cases where homosexuality was 
linked to a fear of women, heterosexual behaviors were increased following 
desensitization, therapy by LoPiccolo (195). Marzagao (219) analyzed a case of 
kleptomania as involving the maintenance of such behavior through anxiety 
reduction; desensitization was substituted as a means of coping with anxiety, after 
which the stealing diminished. An anxious patient with psychotic auditory 
hallucinations was substantially helped in a study by Slade (315). A combination 
of desensitization and covert sensitization was a superior approach to reduce 
smoking in studies by Wagner & Bragg (350). 

The delivery of desensitization treatment through automated or semiautomated 
procedures seems to be useful. Baker, Cohen & Saunders (13) compared thera- 
pist-directed desensitization with tape recorded treatment and found equivalent 
effects. Cotler (67) ran four therapist present and four tape recorded treatment 
groups and discovered that approach behaviors to the feared animals and subject 
estimates of anxiety were similar for all treatment groups. Donner (81) reported on 
a 5 month follow-up of automated desensitization for test anxiety and found that 
treatment effects were not only maintained but further gains were made during the 
period without symptom substitution. Therapist presence did appear to have an 
effect in improvement of grades. Spinelli (320) found that therapist presence or the 
opportunity for a relationship had no effect on symptom reduction, but did affect 
client satisfaction with the service. A somewhat dissenting note appears in the 
study of Paul & Trimble (268) in which physiological measures were compared for 
live relaxation training versus taped training. Taped training was found inferior 
except on self-report measures. From these basically promising beginnings, 
further methods for reducing the role of the therapist have been attempted. Beck 
(28) and Mann (214) utilized video tapes for the treatment of test anxiety. Den- 
holtz (79) encouraged the use of audio tapes recorded during live treatment 
sessions for clients to take home for further use. Clark (64) and Phillips (270) re- 
lied upon self-administered desensitization, typically through the help of 
self-instructional manuals. Goldfried (112) instructed clients in how to use de- 
sensitization principles; and Russell & Sipich (299) added training in how to cue 
off relaxation. Lutker, Tasto & Jorgensen (206) had subjects memorize their 
individualized hierarchies from index cards to eliminate the necessity for the 
therapist to describe the scenes. 

Other variations or emphases on technique variables in desensitization have 
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been published. Suinn, Edie & Spinelli (337) massed treatment in a 2 hour time 
span, using only the highest anxiety hierarchy items and without use of anxiety 
signaling from the subjects. The success of this approach raised some questions 
regarding the necessity of hierarchies or repetition of lower scenes following 
signaling. Dua (83) examined massing effects using 12 hour, 5 day, and 15 day 
treatment periods and found that the two shorter term treatment periods were 
better than the 15 day procedure. The use of abbreviated hierarchies raises the 
question regarding the importance of such hierarchies. Mylar & Clement (258), 
dealing with public speaking anxiety, concluded that there was a low correlation 
between the number of hierarchy items completed and recovery. Further exam- 
ination of the entire issues of anxiety hierarchies, the length of treatment, and 
the requirement of signaling seems desirable. 

As desensitization therapy has become accepted as a mode of treatment, re- 
search has moved away from proving its worth and more towards methodological 
issues. One primary methodological criticism of previous research has been that 
related to expectancy, demand characteristics, or set. Simply, the concern is that 
desensitization has been shown to work, but because clients have been led to 
expect it would work. Thus it has been the suggestibility of the clients rather than 
the elements of counterconditioning which is responsible for the favorable out- 
comes. Lomont & Brock (194) discovered that all desensitization groups showed 
equivalent reduction in snake phobia regardless of whether they expected a 
favorable outcome or were given a rationale about the treatment. McGlynn (236) 
gave one group therapeutically favorable expectations, a second therapeutically 
unfavorable expectations, and a third instructions suggesting that the study did not 
involve therapy at all, and compared results against pseudotherapy and no- 
treatment groups. All desensitization groups showed behavioral improvements 
better than the two control groups. Moreover, the three treatment groups did not 
differ from one another, suggesting that level or nature of expectations are not 
influential variables. A study with conflicting findings was completed by Miller 
(244) using (a) a therapeutically favorable desensitization group; (b) a desensiti- 
zation group with no instructional set; (c) a group with the set that the study 
involved imagery research rather than therapy; (d) a pseudotherapy group with 
favorable therapeutic instructions; and (e) a no-treatment group. Groups a and b 
showed greater behavioral changes than groups c, d, and e. A measure of 
awareness regarding the therapeutic nature of the research was found related to 
outcome. Miller concludes that therapeutically favorable instructions do not en- 
hance desensitization efficacy, although misleading subjects so that they are un- 
aware of the therapeutic intent of the procedures seems to reduce the effects of 
desensitization. Persley & Leventhal (269) developed groups differing on two 
factors: therapeutically favorable versus neutral set, and imagery paired or not 
paired with relaxation. Behavior approaches to a rat improved for all groups 
except the one involving a neutral set combined with imagery without relaxation. 
The therapeutically favorable instructions did not seem to substantially alter the 
level of fear reduction; this is consistent with the Miller results. On the other hand, 
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imagery without relaxation was found effective when the subjects also had a 
therapeutically favorable set, suggesting that expectancy plays some role in be- 
havioral change. 

Another major issue raised in the literature involves the role of relaxation. One 
series of studies was concerned with'whether relaxation was needed for desensiti- 
zation; another series aimed at the effects of relaxation alone and without anxiety 
hierarchy imagery. Crowder & Thornton (70) had subjects imagine scenes without 
relaxation and discovered that this group. made gains on a behavioral avoidance 
task equivalent to that of a standard desensitization group. Waters, McDonald & 
Koresko (353) also found no differences across groups in behavioral avoidance 
measures, but the standard desensitization group showed better improvement on 
subjective fear and some physiological measures. They concluded that relaxation 
played a limited role in desensitization. Agras et al (3) used subjects as their own 
control by doing a reversal type study. With a very small sample size, they found 
that therapeutic gains were generally maintained when relaxation was removed. 
Aponte & Aponte (6) attacked the issue in a slightly different fashion by rear- 
ranging the time of appearance of the relaxation responses. The standard desen- 
sitization group received the relaxation training prior to scene visualization, while 
the experimental group was provided with the anxiety imagery first and then 
trained in relaxation. Both groups fared equally well on test anxiety measures 
following treatment. Other researchers have worked with relaxation alone. Al- 
though earlier studies suggest that relaxation alone is less effective than desen- 
sitization, two recent ones concluded otherwise. Freeling & Shemberg (101) 
examined the relative contributions of relaxation alone, imagery alone, and re- 
laxation plus imagery (standard desensitization). They found relaxation alone led 
to equal reductions on subjective measures of test anxiety compared to relaxation 
plus desensitization, with these two methods being superior to imagery alone. On 
the other hand, imagery was found to lead to substantial improvements on a 
behavioral task as compared to the other methods. Laxer & Walker (180) also 
studied test anxiety and concluded that relaxation alone was a successful ap- 
proach. . 

A confounding variable which confuses attempts to clarify methodological 
problems such as that just reviewed has to do with criterion measures. Some of the 
inconsistencies across studies may well be related to differences in measures. 
Among the common means used to assess fear reduction are changes in self- 
estimations of subjective anxiety, physiological measures, or behavioral avoid- 
ance tasks. It is becoming clearer that these measures do not correlate well with 
one another. Ross & Proctor (298) reported a .41 correlation between subjective 
anxiety measures and performance on a behavior avoidance task. Mylar & Cle- 
ment (258) found discrepant results regarding the effects of implosive therapy 
when subjective measures were compared with a, behavioral measure. Wilson & 
Thomas (363) noted that favorable therapeutic instructions affected fear reduction 
as measured by subjective reports of anxiety, but not when measured by behavior 
avoidance tasks. Davidson & Hiebert (74) discovered a stressor film presented 
under varied conditions to have an effect on subjective anxiety but that the results 
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differed with physiological measures. Riddick .& Meyer (290) were able to use 
therapeutic instructions with a placebo pill to make it effective in reducing sub- 
jective anxiety but not physiological responses. Thus one need which faces future 
research is the need for standardization of criterion measures, and better under- 
standing of the meaning of differing measures. 


Aversive Conditioning 


Aversive conditioning techniques typically rely upon aversive consequents as a 
means of modifying behaviors. Aversive procedures may involve escape learning, 
active avoidance learning, passive avoidance learning, or response contingent 
aversive stimulation (punishment) (198). A review of the literature quickly 
confirms that aversive treatment seems reserved for those behaviors considered to 
be deviant because of cultural factors or excesses. Thus, common target behaviors 
include excessive eating, habitual smoking, heavy consumption of alcohol, or 
sexually deviant activities. Indeed, Davison & Wilson (76) discovered in a survey 
that behavior therapists overwhelmingly select aversive therapy for homosexual 
problems. Over the period of this review, studies were found reporting on the 
treatment of eating problems (358), exhibitionism (325) and other sexual devia- 
tions (1), hallucinations (356), seizures (370), stuttering (72), and voyeurism (110). 
Some of the cases of deviancy were removed by a shame aversion technique. 
Serber (308) required the patients to perform their deviant behaviors in the 
presence of observers, emphasizing the embarrassing consequences of such reve- 
lation. Stevenson & Jones (325) increased the aversive experience through having 
staff directly discuss the patient and his deviancy. Self-administered shock was 
useful in reducing hallucinatory behaviors among schizophrenics to a greater 
degree than for a placebo group who had a shock box but received no shock (356), 
or a no-treatment group. However, hallucinations decreased for all groups, raising 
methodological questions about how the placebo influences the research. 

A case report on self-induced seizures is one of the few that resulted in symptom 
substitution of sorts. Wright (370) eliminated self-induced seizures triggered by 
hand cues, but the mentally retarded client switched to eyeblinks to stimulate 
seizures. Another case of individual failure was reported by Kenny, Solyom & 
Solyom (161). Of five patients with obsessional thoughts, four showed reductions 
following aversive treatment. However, the remaining patient developed psy- 
chotic delusions. Some informative results were obtained by Daly & Frick (72) 
in understanding possible failures with stutterers. They discovered’ that shock 
administered when a stutterer expected to stutter failed to reduce stuttering; 
however, shock presented when stuttering occurred did lead to reduction. It may 
be that covert responses such as expectations are not necessarily anticipatory cues 
for stuttering. 

A research on stridor emissions has added information to further evaluate 
variables involved in aversion control. Lebow, Gelfand & Dobson (184) experi- 
mented with a patient who emitted high pitched sounds by placing him on 
different schedules of aversive conditioning. During one experimental phase the 
patient was able to escape shock if he emitted less than 10 sounds within 10 
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seconds; in a second phase the patient escaped if less than 3 sounds were emitted 
in 10 seconds; in a third phase the patient was shocked after any emission. They 
discovered that a regular reinforcement schedule was inferior in controlling the 
behaviors. In addition, they noticed that the transition from one schedule to 
another tended to promote relapses in learning. Rosenthal (297) discovered that 
the application of aversive therapy based upon a traditional punishment paradigm 
failed with a pedophiliac. However, using aversive consequences based on a 
shaping paradigm was successful. The examination of data associated with prog- 
nosis is slowly reaching the literature. Evans (88) compared ten failures with ten 
successes in the aversive treatment of exhibitionism. Failures appeared related to 
frequency of masturbation, deviant fantasy, and length and frequency of exhibi- 
tionism prior to treatment; failure rate was not associated with marital status, age, 
educational level, frequency of masturbation or intercourse. MacCulloch, Birtles 
& Feldman (207) suggest that “weak-willed” and attention-seeking homosexuals 
are poor risks for anticipatory avoidance aversion therapy, while those with 
satisfactory prior heterosexual relations are good risks. 

The treatment of excessive alcoholic consumption or smoking has received 
much attention. Based upon the relationship between blood alcohol concentration 
and drunken behavior, Lovibond & Caddy (199) trained patients to discriminate 
levels of concentration. Patients were shocked if their alcohol level went beyond 
.065%. Of 31 patients, 21 were able to learn to drink at the moderate level. 
Follow-up data appeared to show retention of the moderation. Sobell & Sobell 
(319) describe an intensive program for training moderation involving shock, 
identification of alternative behaviors, stimulus control, and video tape feedback. 
Patients treated through this program did substantially better than patients treated | 
by traditional hospital treatment at discharge and 6 month follow-up. An impor- 
tant aspect of the research was the application of the program to investigate effects 
on two different target behaviors: drinking in moderation versus complete ab- 
stinence. Follow-up results indicate that patients trained in moderate drinking 
were as free from drunkenness as the patients trained in abstinence. Moreover, the 
moderate drinkers were able to participate in alcohol consumption for a larger 
percentage of days without excess than the patients trained in abstinence. These 
studies are encouraging in light of the previously held assumption that alcoholic 
excesses can only be dealt with by abstinence and not by controlled drinking. 

The control of smoking has been notoriously resistant to change. Research has 
been frustrating when follow-up data are considered because of the relapse rate. 
Chapman, Smith & Layden (61), included therapist monitoring at follow-up 
periods as part of their program of shock and self-management training. They 
discovered that the subjects with 2 weeks of posttreatment follow-up meetings had 
a 75% relapse rate, while subjects with 11 weeks of follow-up had a 45% relapse 
rate. Schmahl, Lichtenstein & Harris (303) confirmed that different intervals of 
follow-up led to differential outcomes in abstinence. Their program involved 
aversive training with smoky air as well as high favorable expectancy instructions 
and social reinforcement. The expectancy factor may have been a component in 
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the finding that a control group also showed gains. Some self-control approaches 
have been attempted, emphasizing covert variables. Berecz (30) used self- 
administered shock with one group, using the shock following imagined smoking, 
while another administered the shock following actual smoking. The results sug- 
gested that the covert imagery procedure was more effective for the heavy smoker, 
while both procedures were of value for the moderate smoker. Sex differences 
appeared to be present, with the results being inconclusive for female subjects. 
Such studies start to focus attention on covert stimuli as part of the prompts for 
behavioral excesses. Along this line, one final study might be mentioned briefly. 
Gaupp, Stern & Ratliff (110) directed the treatment of voyeurism towards words 
associated with the deviant behavior. Shock was administered to slides with 
voyeur-related words and aversion relief to slides associated with acceptable 
sexual behaviors. The success of this approach expands the selection of target 
behaviors to include semantic responses and is consistent with language behavior 
therapy (321) and semantic desensitization (136, 138). 


Anxiety Induction Methods 


FLOODING? Flooding eliminates fears through forcing the patient to contin- 
uously experience anxiety in imagery or in reality. Although originally aimed 
at fear reduction, some recent applications have been made of flooding to 
obsessive-compulsive behaviors. For example, Baum & Poser (26) used the tech- 
nique with a neurotic and a psychotic with success. Hodgson, Rachman & Marks 
(145) compared a flooding approach to others and found flooding plus modeling 
in vivo to be better than flooding alone, modeling alone, or relaxation alone with 
five obsessive-compulsives. These authors (280) added a unique concept to the 
literature through an appendix to their article describing patients within each 
group and reporting on treatment failures. 

Some variations in the flooding method have been tried to improve on its usage. 
Dawley & Wenrich (77) designed high anxiety scenes to provide flooding to test 
anxious subjects in a group setting. Prochaska (275) selected the scenes on the 
basis of situation specific cues or psychodynamic cues. In the latter group, scenes 
were composed based upon the assumed dynamics of test anxiety. A third group 
was faced with scenes associated with general anxiety settings not related to test 
anxiety. Following tape recorded treatment, it was found that the use of flooding 
with either situational or psychodynamic cues was more effective in reducing test 
anxiety than the general anxiety approach, a placebo group, or a no-treatment 
group. However, the general anxiety group showed significant decreases in general 
anxiety as measured by a fear inventory. Stadter (322) examined the recommen- 
dation that encouraging clients to emit tense bodily movements during flooding 


3 A variety of terms have been used to refer to high intensity anxiety stimulation. Flooding 
tends to mean the evoking of high anxiety through real or imaginal stimuli, while implosion 
generally has meant the arousal through fantasy. In agreement with Marks (215), the term 
flooding will be used to refer generically to both techniques. 
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exposure would enhance the outcome. Results on rat phobics did not support the 
hypothesis; rather it was discovered that bodily movements plus using a toy rat to 
mimic scenes seemed helpful even without flooding arousal. 

Further investigation of the variables influencing the outcome of flooding have 
revealed some interesting results. Stern & Marks (324) discovered that two hours 
of flooding reduced agoraphobia substantially more than four half-hour sessions. 
Heart rate and subjective anxiety decreased more over the second than the first 
hour of prolonged exposure. In a small sample of case studies, Boulougouris & 
Bassiakos (48) continue the discussion of prolonged implosion techniques as 
applied to obsessive-compulsive behaviors. Fantasy sessions lasted up to 50 min, 
and combined fantasy-practice sessions were run for up to 3 hours. Fazio (92) 
attempted to evaluate the reality-testing and supportive components of flooding 
separate from the anxiety induction aspect. One of the few double-blind experi- 
ments was carried through, with findings indicating the reality testing-supportive 
oriented treatment group subjects showed greater reductions in fear than the 
flooding subjects. Watson & Marks (355) compared flooding of fears expressed by 
the subjects (relevant) with flooding of fears irrelevant (because of being unlikely 
to occur, e.g. being eaten by a tiger). Both approaches led to similar.reductions in 
the subjects’ fears. However, the authors examined the data for information 
possibly associated with prognosis. They discovered that subjects showing high 
physiological reactivity at the start of treatment appeared to profit from flooding 
of relevant but did not profit from flooding of irrelevant scenes. High subjective 
anxiety subjects prior to therapy experience poor outcome with flooding of ir- 
relevant scenes. On the other hand, subjects who reported high subjective anxiety 
during treatment sessions seemed to respond favorably to flooding of irrelevant 
scenes. A final study should be mentioned regarding the influence of expectancy 
on outcome. Borkovec (42) manipulated favorable therapy versus neutral 
nontherapy and false physiological feedback in snake phobic subjects. Results 
indicated that the subjects with positive expectations of improvement and false 
feedback showed significant reductions in avoidance measures, while the neutral 
plus false feedback subjects showed no effects. 

The comparative effects of flooding and desensitization have received much 
attention because of the similarity of target behaviors treated. Mylar & Clement 
(258) found flooding to be equal to desensitization in the treatment of public 
speaking anxiety. DeMoor (78) reported the two approaches to be equivalent with 
snake phobia, although there was a nonsignificant trend for desensitization to be 
more stable at follow-up. Mealiea & Nawas (238) determined desensitization to be 
better than flooding, with flooding being no better than pseudotreatment on snake 
phobics. At follow-up desensitization continued to be the best, with flooding the 
worst. Hekmat (136) indicated that desensitization was better than pseudotherapy 
in the treatment of fear of rats. Flooding was not found to be effective. On test 
anxiety, Cornish & Dilley (65) found desensitization to be better when the crite- 
rion was a measure of debilitating anxiety. However, both approaches were 
equivalent on a physiological measure. Using grades, Smith & Nye (318) 
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reported desensitization to be better in effecting change than flooding. Finally, 
Boulougouris, Marks & Marset (49) concluded that implosion was better for agora-. 
phobia, while desensitization had better results with “specific phobias.” 


ANXIETY MANAGEMENT METHODS Methods such as anxiety management training 
(AMT) and induced anxiety (IA) encourage clients to focus attention on anxiety 
arousal. However, unlike the implosion technique, the anxiety management ap- 
proaches then train the clients in anxiety control. Anxiety management training is 
based upon the idea that anxiety can be viewed as a response as well as a stimulus. 
By increasing the stimulus properties and then associating the stimuli to responses 
incompatible to anxiety, the client learns to reduce the anxiety. Although AMT 
was originally designed to train coping behaviors in clients with “free-floating 
_ anxiety,” early published studies were on specific anxieties. Suinn & Richardson 
(339) compared AMT-treated mathematics-anxious subjects with desensitization 
subjects and a no-problem control group. The AMT and desensitization groups 
both showed significant reductions in subjective anxiety, with anxiety levels 
reaching that of a no-problem group. Improvements in behavioral measures were 
also observed. Edie (86) and Nicoletti (261) studied the effects of AMT on free- 
floating anxiety and public speaking anxiety and discovered AMT to be beneficial 
when compared with no-treatment groups and no-problem control groups. Clini- 
cal observations suggested that some clients who were originally unable to specify 
the stimuli prompting anxiety responses were later able to identify such cues 
during AMT treatment. Goldfried (113) also used a method combining anxiety 
arousal followed by the use of relaxation as a self-control technique. Unlike AMT, 
Goldfried’s method is a variant of desensitization and retains some of the desen- 
sitization procedures and concepts. His technique continues to rely upon the use of 
hierarchical scenes to prompt anxiety, although the scenes may be reduced to a 
single multithematic one. Whereas the AMT method was derived from a specific 
theoretical viewpoint and aimed at free-floating anxiety, Goldfried’s variation of 
desensitization appears to have been developed for clinical rather than theoretical 
reasons. The variation overlaps enough with AMT as to be classifiable as an 
anxiety arousal method. 

Whereas AMT stimulates anxiety arousal through directing clients to use 
imagery, recall of previous anxiety experiences, the induced anxiety method uses 
suggestion to bring forth unspecified feelings which are later shaped into affective 
reactions. The IA procedure also reinforces overt responses, utilizes inquiries 
about thoughts and feelings linked to affective states, and follows up with inter- 
pretation and discussion to promote understanding and integration. Korn, As- 
cough & Kleeneier (170) tested the hypothesis that IA actually aroused anxiety 
among subjects. Results confirm that IA led to change in state measures while not 
affecting trait measures. Burns & Ascough (55) basically confirmed the ability of 
IA to increase anxiety as measured by physiological instruments and when com- 
pared to relaxation suggestions. Bornstein & Sipprelle (47) applied IA to weight 
reduction. An IA group was compared with a relaxation-only group, a group 
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receiving verbal reinforcement, and a wait list group. The IA group showed the 
maximum weight loss following treatment and on through a 3 and a 6 month 
follow-up. 


Modeling 


The presentation of a model is based upon the premise that behaviors may be 
altered through imitative learning. A variety of behaviors have been approached 
through modeling techniques, including moral judgments among delinquents 
(274, 301) and independence in neurotic and psychotic patients (117). A favored 
use of modeling is in the treatment of assertive behavior deficits. Unassertive 
psychiatric patients were trained to respond to five interpersonal conditions via 
models by Eisler, Hersen & Miller (87). A control group received practice in the 
five situations but no models to view. The model-trained patients showed signifi- 
cant improvement on five of eight components of assertiveness as compared to the 
control group and a no-treatment group. McFall & Twentyman (235) conducted a 
series of four experiments involving as many as 90 patients to determine the 
relative contributions of modeling, rehearsal, and coaching to assertive training. 
Behavior rehearsal seemed to be the most important variable in effecting change, 
with modeling adding little when combined with rehearsal. This held regardless of 
the media used: video versus audio only. Finally, overt and covert rehearsal 
appeared to be equally effective. Young, Rimm & Kennedy (373) examined the 
relative contribution of verbal reinforcement to the acquisition of assertive be- 
havior. Using undergraduate women, they discovered modeling alone to signifi- 
cantly increase assertiveness as compared with pseudotherapy or no treatment. 
Reinforcement added to modeling did not appear to make a substantial contri- 
bution. Rathus (284) used video tapes with models discussing assertive behaviors 
and then displaying them. Subjects showed increase in assertive behaviors, but 
their social fears remained unchanged. 

Studies have been aimed at determining the specific variables affecting the 
outcomes of modeling attempts. The characteristics of the model appear to have 
some effect. Doster & McAllister found that a high status model affected under- 
graduate subjects more than a peer model (82). Kazdin (159) attempted to reduce 
fears of snakes through the presenting of either a model who was “coping” 
(anxious in approach but eventually successful) or “mastering” (confident and at 
ease in successfully approaching the snake). Control groups included a no-model 
group and a no-treatment group. The modeling groups showed greater reductions 
in emotional arousal and anxiety ratings than the control groups. The coping 
model was more influential than the mastery model, suggesting the impact of 
model-to-client similarity. On the other hand, Bandura & Barab (17) discovered 
that subjects improved regardless of similarity to the model. They concluded that 
fear reduction for adults and children occurred for differing reasons. The mech- 
anism associated with reduction via adult modeling was considered to be that of 
vicarious extinction, while the reduction via child modeling was viewed as in- 
volving motivational inducements. Jaffe & Carlson (149) had models who were 
calm receiving positive feedback, or anxious with positive feedback, or calm with 


INDIVIDUAL PSYCHOTHERAPY AND BEHAVIOR THERAPY 537 


negative feedback, or anxious with negative feedback. Test anxious subjects ex- 
posed to anxious models showed most improvement in grades, while viewing 
models with negative consequences seemed to lead to reductions in subjective 
anxiety. The authors question the validity of other researchers’ reliance upon 
models in competent roles in treatment studies. Of final interest is the possible 
effect of sex differences, O’Sullivan, Gilner & Krinski (264) used male and female 
models to reduce snake phobia. Male subjects seemed to respond equally well to 
either sex models, doing better than males without modeling ireatment. On the 
other hand, women students failed to show any gains throughout the study. 

Rachman (278) concludes that modeling is facilitated by acombination of audio 
and visual presentations, repeated practice, prolonged exposure times, the use of 
multiple models, the use of multiple fear stimuli, the use of participant modeling, 
and the use of relaxation training. He summarizes Bandura’s recommendations as 
involving the use of clear pretherapy instructions, repeated modeling, a progres- 
sive increase in the difficulty of modeled approaches, the use of a model similar to 
the observer, and the description by the model of progress. Bandura’s comments 
regarding participatory modeling include the need for repeated practice, provision 
of guidance during practice, encouragement, the graduation of practice exercises, 
the use of feedback, the provision of favorable conditions during practice, and 
regular reinforcement. 

Contact desensitization may be viewed as a form of participatory modeling. By 
this procedure, the therapist models the adaptive response, using graduated ex- 
posure to the feared object and physical assistance to aid the client in performing 
fear-eliciting behavior. A few studies deserve mention as attempts to identify the 
influence of specific variables on outcome. Lick & Bootzin (192) compared contact 
desensitization with a placebo group with a high expectation of success, a role- 
playing group asked to simulate change, and a no-treatment group. It was dis- 
covered that contact desensitization was clearly superior to the other groups, 
suggesting that expectancy or demand characteristics play minimal roles in this 
therapy procedure. Murphy & Bootzin (257) compared the use of active versus 
passive participation in contact desensitization. In an active mode, the child was 
encouraged to approach the feared snake; in the passive mode, the child had the 
snake brought nearer by the therapist while the child remained seated. Although 
stimuli were more fear eliciting in a passive mode, both procedures led to equiv- 
alent levels of fear reduction. 


Other Techniques 


COVERT METHODS Covert reinforcement (COR) is based upon the assumption 
that mediational thought cues can serve to reinforce desired behaviors (58). Added 
to COR has been the related methods of covert negative reinforcement (59) and 
covert extinction (60). All of these methods aim at control over the presentation or 
removal of reinforcers through controlled fantasy. In covert sensitization, an 
undesirable response is eliminated through pairing fantasy of the action with 
aversive fantasy. Wisocki (367) used covert sensitization, covert reinforcement, 
and thought stopping to treat a heroin addict. Curtis & Presly (71) eliminated 
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homosexual activities through covert sensitization, showing a reduction from a 
baseline of one homosexual encounter per week to none. Barlow et al (20) found 
covert sensitization successful in counteracting homosexual arousal to photo- 
graphs despite instructions to achieve arousal. Foreyt & Hagen (98) demonstrated 
that overweight subjects reported decreased liking for favored foods following 
either covert sensitization or placebo treatment. Thus the influence of expectancy 
or demand characteristics is raised. 

Coverant control methods are basically covert although the target behaviors for 
change are thoughts, fantasies, or imaginations themselves. COR and covert 
sensitization, on the other hand, deal with imaginations primarily as a means to 
alter overt behaviors. Johnson (151) and Todd (344) both used index cards with 
statements read by their clients, followed by a high frequency response as a 
reinforcer. Depressive states were reduced in both reports. 

Self-management training is becoming a significant trend. The stress is on 
helping patients to help themselves, Self-control techniques are reviewed by 
Mahoney (208) and Meichenbaum (239). Self-monitoring is one such technique. 
McFall (233) discovered that recording resistances of the urge to smoke led to 
decreases in smoking, while self-monitoring of the actual frequency of smoking 
led to increases. Mahoney et al (209) found that continuous self-monitoring 

tended to maintain desired behaviors longer than intermittent self-monitoring. 
- Gordon & Sachs concluded that “... having a S keep a daily record of his own 
smoking behavior is the most active ingredient of change... .” (italics added) (119, 
p- 12). Kanfer (153) suggests that self-monitoring should be viewed as involving ` 
self-reinforcement. A different perspective is provided by Logan (193), who views 
self-control itself as a drive or learned response which might be shaped. 


ASSERTIVE TRAINING Some of the literature has been covered in the prior section 
on modeling. Lazarus (183) criticizes researchers for overtraining in negative, 
aggressive responses in assertive training, while overlooking positive aspects. He 
also adds a conceptualization of assertion as involving the ability to say “no,” the 
ability to ask for favors, the ability to express positive and negative feelings, and 
the ability to initiate or terminate conversations. Rathus (284) classifies assertive 
behaviors into nine categories, including: assertive talk, feeling talk, active dis- 
agreement, agreeing with compliments, and avoidance of justifying of opinions. 
Wolpe (368) through his “supervision transcripts” elaborates on training novice 
therapists in learning to give up control as patients learn greater assertiveness. 


BEHAVIOR REHEARSAL Behavior rehearsal employs the concept that emitting a 
response is a necessary step in its acquisition. When used therapeutically the 
rehearsal is typically achieved under controlled conditions. Hedquist & Weinhold 
(132) found a behavior rehearsal group to emit more assertive verbal responses 
than a placebo control group. McFall & Marston (234) found a nonsignificant 
tendency for behavior rehearsal with feedback to be better than without feedback, 
using an unobtrusive assertive test. Suinn (335) used visuomotor behavior 
rehearsal (rehearsal with controlled imagery) to help a doctoral student pass his 
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examinations. Results involved behavioral change without reduction of anxiety. 
Suinn also employed visuomotor behavior rehearsal to enhance the performance 
of ski racers. 


FEEDBACK This technique involves providing information to the subject re- 
garding performance. Typically the feedback data are in the form of physiological 
information. Considerable controversy has arisen regarding the ability of false 
heart rate feedback to reduce fears. Borkovec (46) points to the variables of 
expectancy, level of fear, belief in the data, and pretesting demands as all affecting 
false feedback. In another light, Borkovec (43) also raises the possibility that 
subjects who perceive their autonomic cues accurately should be equally respon- 
sive to either desensitization or flooding therapies for fears. Inaccurate perceivers 
may be better candidates for flooding, while low autonomic reactors might be 
better for cognitive restructuring. Feedback has also been used as a direct training 
method for relaxation. Some form of biofeedback was used to treat pain or anxiety 
symptoms by Budzinski, Stoyva & Adler (54) and Raskin, Johnson & Rondestvedt 
(283). 


MISCELLANEOUS A variety of interesting miscellaneous techniques have been 
reported. Marquis (218) changed sexual object choice through masturbatory fan- 
tasies. Moan & Heath (251) initiated heterosexual behavior in a homosexual 
through direct electrical septal stimulation. A fading technique interchanging 
male and female slides was used for treating homosexuality by Barlow & Agras 
(19) and McCrady (232). Suinn (338) used excitatory scenes to counter highway 
hypnosis, while tension headaches (342) and an LSD freakout (336) were con- 
trolled by relaxation training alone. 


ASSESSMENT METHODS A number of approaches to facilitate behavioral assess- 
ment have been developed. These include: assertiveness scales (25, 103, 104, 284); 
homosexual arousal testing equipment (14); a sexual orientation measure (95); a 
school behavior check list (243); and imagery measures (286). A variety of in- 
struments are also compared and discussed in the literature (34, 45, 124, 142, 306). 
Finally, several studies have reported on the Fear Survey Schedule (91, 179) or 
offered a factored version (24, 175, 240, 341, 343). 


Reviews and Books 


A number of valuable reviews have been published with conclusions relevant to 
either theory, practice, or research. Goldfried & Kent (114) discuss the predictive 
ability of behavioral versus traditional assessment. Davison & Wilson (75) report 
on research as it relates to desensitization theory and practice and also review fear 
reduction studies on animals as an experimental basis for desensitization (362). 
Wilkins (359) and Borkovec (44) both discuss the social and cognitive variables 
potentially involved in desensitization. Ayer (10), Morganstern (252), and Marks 
(215) cite the literature on flooding or implosion. Blanchard & Young (27) survey 
the effects of cardiac self-control studies. Rachman (278) discusses the applica- 
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tions of modeling. Carlson, Hersen & Eisler (57) examine trends in token economy 
programs. Berkowitz & Graziano (33) review the role of parents as behavioral 
change agents. Mahoney (208) offers recommendations regarding further research 
in self-control methods. Hersen, Eisler & Miller (143) summarize clinical work on 
assertive responses. Abramson (2) and Stunkard (332) discuss differing ap- 
proaches to weight control. Hartmann (130) raises several issues within behavior 
modification methods with children. McNamara & MacDonough (237) comment 
on methodological problems facing general behavioral research. Atthowe (7) 
surveys a variety of contributions of behavioral techniques for intervention. Fi- 
nally, Lanson & Benassi (176) offer some data on the availability of behavior 
modification training. 

As the field of behavior therapy has expanded, so has the availability of books 
on the topic. This period has seen the publication of some truly important volumes 
aimed at scholarly overviews, such as the works of Kanfer & Phillips (154) and 
Yates (372), which fit in nicely with the previously published books of Bandura 
(15) and Franks (93). Some materials emphasizing clinical technique are also now 
available including the slightly expanded second edition of Wolpe’s (369) book, 
Lazarus’ writing on broad spectrum therapy (181), Watson & Tharp’s unique 
self-modification book (354), Stuart & Davis’ on control of obesity (327), Gelfand 
& Hartmann’s on child behavior modification (111), and Liberman’s (190), and 
Goldfried & Sprafkin’s (116) on behavioral analysis and therapy. Topics of special 
interest include the writings of Bandura (16) on aggression, Rickard (287) on 
programmatic approaches, Gardner (107) on mental retardation, O’Leary & 
O’Leary (263) on classroom management, Feldman & MacCulloch (95) on ho-. 
mosexuality, Shapiro et al (309) on biofeedback, and Goldfried & Merbaum (115) 
on self control. In addition to the directory ofthe Association for the Advancement 
of Behavior Therapy, there is also available a directory of behavioral therapists in 
the United States and Canada (254). 


Critique and the Future 


During one early phase of preparation of this review over a thousand files on 
published research had been formed. Such an impressive amount of literature 
must surely reflect major investments of research efforts by vast numbers of 
behaviorists. As we conclude this review, we are struck by both positive and 
negative feelings. On the favorable side, it is gratifying from a clinical viewpoint to 
see the wide usage of behavior therapy techniques and the apparent continuing 
success of such approaches. Somehow behavior therapy continues to have a 
positive impact on clients even amidst the theoretical ambiguities that still sur- 
round therapeutic methods and the controversies which continue to arise. On the 
unfavorable side, it is highly unsatisfying from an experimental viewpoint to note 
that the field still suffers from its share of poor research designs and inconclusive 
results. From a clinical sense, we seem to know how to intervene successfully; but 
from an experimental sense, we may not be able to specify precisely or conclu- 
sively the crucial variables involved in such intervention. Our salvation, of course, 
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is that such a dilemma may be simply inherent in the field of therapy, whether it 
be behavior therapy or psychotherapy, for both suffer equally. 

Several important methodological issues have become evident in the behavioral 
research. First, clarification is needed about the discrepancies observed among 
differing measures. For example, behavior avoidance measures of fears do not 
always correlate with subjective estimates or physiological data (290, 298). One 
explanation is that each measure reflects different components of fear responding. 
Thus an improved behavioral approach score but an unchanged subject anxiety 
score would mean that the client had learned new overt responses but has not yet 
altered affective responses or covert attitudinal responses. Overt behavior change 
may precede or follow, but it is independent of affective, attitudinal, or cognitive 
change. The case of the client mentioned earlier who learned to perform adap- 
tively in spite of anxiety (335) is an example. Another explanation is that discrep- 
ancies reflect methodological flaws in research design or in measurement. A 
subject might successfully undertake an anxiety-eliciting act while remaining 
fearful because of the demand characteristics of the experiment (269). Given the 
lack of clarity and consistency in outcome measures, then it is difficult to interpret 
outcome studies on a treatment technique where different criterion measures are 
used, A second methodological issue is the great need for more research relying 
upon standardized measures. Even the Fear Survey Schedule now has so many 
forms that it is difficult to know whether there is comparability of data across 
studies using this instrument. A third issue is the fact that several extraneous 
variables appear to influence outcomes and thus demand better control. Client 
expectations seem to be important under some conditions with some measures and 
for some treatments. More detailed studies are needed to specify such interactions 
more precisely. In passing it might be stated that methods for manipulating 
expectancies differ across research. “High expectancy” sometimes refers to in- 
structions that the procedure involves simply a form of therapy, while at other 
times it refers to instructions that the procedure has been found to lead to symp- 
tom reduction. A fourth problem concerns the effects of modeling. To what extent 
are token economy or other group setting studies effective because patients ob- 
serve others? 

With the growing data on attentional variables, a fifth serious question can be 
raised regarding the effects of self-report pretesting, base-line data gathering, or 
even the singling out of a chronic patient to receive a new treatment. Perhaps all of 
these promote change solely because of the observer-attention or self-attending 
features. Regarding self-monitoring to obtain base lines, Gordon & Sachs accuse 
“,. researchers unaware of the reactive nature of self-monitoring of being guilty 
of failing to set a true base-line prior to specific treatment” (119, p. 14). Self-report 
measures may be influenced by the reinforcing value of self-monitoring (233, 244) 
or the observer reliability problem. Walter & Gilmore (351) found that even 
though a placebo treatment group of parents accurately described no change in 
their children, they still answered “Yes” to the question, “Has your child im- 
proved?” Therapists are often also observer-raters in their own research; as such, 
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they too may be faulty observers because of bias and dissonance (147). The base- 
line phase of research suffers potentially from the reinforcing value of attention, 
but may also be a problem in that there may be alterations of behaviors due to 
change itself regardless as to the content of the change (184). Chronic patients who 
have been left as hopeless may change when suddenly faced with an “enriched” 
therapeutic environment provided by a therapist with a new technique to tıy. 
Another confounding variable occurs from the fact that such chronic patients are 
often the target of a variety of treatments in their histories, and it is difficult to 
determine whether change is related to the new treatment or the combined results 
of all the experiences. 

On a more global scale, after methodological changes are instituted research 
endeavors need to refocus attention if the field is to benefit. Instead of continuing 
the comparisons of treatment methods to determine whether they are equally 
successful on the same problem, it is time to catalog which techniques are most 
efficient for each of a series of problems. Instead of studying the overall effects of a 
given technique on patients with a common problem, it is now time to isolate the 
variables involved in success or failure with that technique. Failures should not be 
filed away as information mitigating against further use of a technique. Most 
modern behavior therapy methods have received basic experimental support; it is 
important therefore to examine negative results as they might provide guidelines 
for making such therapy techniques more effective. One type of information so 
gained might be through reanalysis based upon further empirical data; shock 
applied to the covert cues prior to stuttering may be found to fail, but shock 
applied to the overt act of stuttering may be successful (72). Another approach to 
profit from failure might be reanalysis with better behavioral analyses; common 
reinforcers might not affect an autistic child because self-stimulation is in itself a 
powerful reinforcer, and a greater magnitude reinforcer is needed to alter such 
behavior (197). 

Instead of combining patients with common overt symptoms, it is time to be 
more precise about behavioral antecedents which would require different inter- 
ventions. For homosexuals, anxiety-reduction techniques might be used where 
heterosexual] anxiety is prominent and heterosexual skills already available in the 
patient’s repertoire (195), while anticipatory avoidance, controlled fantasy, or 
fading procedures might be appropriate where deviant sex object preference is the 
predominating feature (95, 218). The interaction between patient behavioral 
characteristics and treatment modality can appear in another way. Patients with 
high frequency desirable responses may require tokens to maintain these re- 
sponses, while patients lacking such responses in their repertoire may first require 
reinforcers to shape or initiate such responses (4). Self-modification techniques 
may eventually prove to be more efficient with nonhospitalized clients who al- 
ready have self-control behaviors in their repertoire. For others, treatment might 
first be aimed at shaping self-control as a learned “drive” even before attempting 
to train the client in self-control techniques (193). 

Instead of accepting previous explanations for problems, it is time to develop 
better behavioral theories. Some progress is being made in such areas as behav- 
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ioral analyses of aggression (16), depression (66), self-injurious behavior (11), 
obsessions (277), and excessive drinking (259). In the area of behavioral deficits, it 
is time to develop more precise behavioral definitions of the target adaptive 
behaviors. As an example, step-by-step behavioral objectives or behavioral se- 
quences have been defined for assertiveness (183, 284). More work is needed 
regarding such definitions of various adaptive behaviors, as well as integrating 
training in adaptive skills into treatment packages. Removal of anxiety may 
permit one client to then emit available adaptive responses, but another client may 
require training or rehearsal of such responses. Diagnostic and prognostic indices 
need to be better understood. The studies of Borkovec (43) and Watson & Marks 
(355) are tentative but promising along these lines. 

Finally, it is time to extend research for longer follow-up periods instead of 
limiting reports to single posttesting.-Relapses and failures of transfer of learning 
occur only too frequently for some approaches on some problems. Research might 
attend more to variables associated with preventing relapse and promote 
generalization such as overlearning, intermittent schedules of treatment, random 
cueing, lengthening of exposure times, multiple modeling, introduction of covert 
responses in behavioral chaining, imagery rehearsal for transfer, and training in 
principles of self-management. 
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A quick glance at articles published over the years in the Annual Review of 
Psychology shows that many authors start out with disclaimers about the possibi- 
lity of completing the task in the space assigned them by the editors. We are in this 
tradition. The editors asked for a chapter on Prevention of Mental Disorders. The 
review was to be accomplished in 25 pages (or less). Obviously, an impossible task. 
Preventive efforts have been divided by Caplan (71) into three separate major 
areas: primary prevention, secondary prevention, and tertiary prevention. As in 
conventional medicine, primary prevention is the steps taken to prevent the 
occurrence of a disease, secondary prevention is early treatment of the disease 
once it has occurred, and tertiary prevention is the attempt to minimize the 
long-term effects of the disease. 

Recognizing the impossibility of the task of reviewing this tripartite field, and 
with the agreement of the editors, this chapter will be limited to an examination of 
primary prevention. Even here the literature is so enormous and amorphous as to 
defy adequate coverage. Our reading for a year leads us to the conclusion that 
practically every effort aimed at improved child rearing, increasing effective 
communication, building inner control and self-esteem, reducing stress and pol- 
lution etc—in short, everything aimed at improving the human condition, at 
making life more fulfilling and meaningful—may be considered to be part of 
primary prevention of mental or emotional disturbance. Legislators, social re- 
formers, radicals, conservatives, and practically everyone else, propose solutions 
to human ills. The law requiring seat belts in automobiles which must be fastened 
before the car can function will clearly reduce the incidence of brain damage in 
thousands of people in the course of a year. Enforcing lower speed limits during 
the energy shortage has reduced the number of accidents and auto fatalities. 
Clearly this is primary prevention of emotional distress in persons not injured and 
in families of nonvictims. 

The plan of this chapter is to provide a few guideposts through the maze of the 


1 We wish to thank J. Greene, M. Solan, I. Turcot, and L. Weiger for their valuable 
assistance. 
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vast literature on prevention. We start with definitions, or preconceptions, that 
people in the field bring to the area. Then we will examine some of the important 
models of the conditions that have been the targets for primary prevention. We 
will go on to discuss epidemiology and some of the social conflicts that may be 
generated by action programs. We will attempt to point out where efforts at 
primary prevention are being made and what the future may hold. Through all of 
this we will try to show the difficulties for scientific study inherent in the field, and 
we will make some attempts at clarifying the research problems. 

The field of primary prevention is analogous in many ways to the great Oke- 
fenokee Swamp. Attractive from a distance and especially from the air, it lures the 
unwary into quagmires, into uncharted and impenetrable byways. There are 
islands of solid ground, sections of rare beauty, unexpected dangers, and violent 
inhabitants. If one explores, and survives, the area becomes compelling, even 
addictive. Meanwhile, the rest of the outside world goes on unconcerned and 
unaware. As we undertook the task of exploring this vast area we became more 
and more conscious of its size and of its undisciplined nature and the vagueness of 
the existing maps. Like many explorers we became a little crazy, somewhat 
monomaniacal. After this heady experience we still remain enchanted, although 
not overanxious to make the same trip again soon. This analogy, too, soon fails to 
represent reality. When reading through the literature on prevention we kept 
thinking of Raimy’s (270) definition of psychotherapy as an “Unidentified tech- 
nique applied to unspecified problems with unpredictable outcomes”, ... for 
which long and rigorous training is required (p. 3). In comparison with the field of 
primary prevention the field of psychotherapy is tough-minded and exact! 
Wherever we looked, whatever we read or heard, there were unexplored or 
unresearched hypotheses about primary prevention. 

What is meant by primary prevention in the field of behavioral disorders? Most 
authors in this area have wrestled with attempts at definition, interspersed with 
more or less optimistic hopes for the future of primary preventive efforts (11, 28, 
29, 33, 43, 45, 51, 73, 79, 86, 92, 94, 97, 106, 112, 124, 131, 134, 137, 140, 143, 151, 
176, 177, 188, 201, 205, 206, 213, 215, 219, 220, 244, 262, 273, 274, 282, 283, 293, 
294, 301, 303, 325, 329, 341, 342, 358, 368, 371.) While Caplan (e.g. 69, 72) has 
written much recently to advance his own model of primary prevention, the 
mental hygiene movement has been around for a much longer period of time than 
he, getting a tremendous impetus in the early part of this century (e.g. 43, 75). We 
are still experiencing a backlash against the extravagant claims of the “mental 
hygiene movement” started by Clifford Beers and others early in the century. This 
movement promised to eliminate most forms of mental and emotional distress 
through education—education of parents, education of children, teaching sound 
mental health principles to teachers, clergymen, and other significant influencers 
(99, 203, 211). 

Currently there are several varying usages of the term primary prevention. A 
sampling of the terminology used in the literature may give an indication of the 
lack of consistency in the field. Wagenfeld (344) sees primary prevention as taking 
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two basic forms: (a) intervention in life crises of individuals, and (b) altering the 
balance of physical, social, cultural, and psychosocial forces in the community. 
This is close to Caplan’s position (e.g. 70). He says the primary prevention of 
mental disorders has a long-term goal of assuring continually adequate physical, 
psychosocial, and sociocultural supplies, which both enable the individual to 
avoid stress and which increase the individual’s capacity to withstand future stress. 
Definitions of primary prevention sometimes tend to be quite general. For exam- 
ple, Zax & Cowen (374) say that virtually all that is done to improve the lot of 
humans should be included. Or the definition can be very abstract. Roberts (283) 
lists three actions as preventive: (a) the removal of the noxious agent; (b) 
strengthening of the host to increase its resistance to noxious agents; or (c) pre- 
venting contact between the agent and the host. Bower (54) defines primary 
prevention as getting children through our health, family, and school institutions 
“smelling like a rose.” Bower (51) has outlined a detailed action-oriented 
framework for accomplishing primary prevention. Without attempting to deal 
fully with his fairly complex model, we note that he believes primary prevention 
can be accomplished through medical, social, and psychological actions which 
must occur during prenatal, perinatal, and later periods of life in those areas of the 
normal social milieu like the school, and in employment settings, and by profes- 
sionals (such as clergy, police, teachers) in institutions affecting people in trouble. 
He gives many examples of normal emotional hazards for which effective 
knowledge of preventive intervention exists and could be brought to bear. In his 
context primary prevention programs are “aimed at persons not yet separated 
from the general population and, hopefully, an intervention specific enough to be 
operationally defined and measured” (51, p. 837). 

Given the large number of definitions of primary prevention, we were tempted 
to follow a suggestion of Wagenfeld (344). He suggested a return to the 
terminology everyone used before Caplan confused matters—namely to discuss 
prevention, treatment, and rehabilitation. Prevention here means the prevention 
of the occurrence of mental disorders; treatment means intervention after the 
onset of the disorder; and rehabilitation means trying to reverse the damage and to 
rebuild the systems that have been disrupted by the disorder. Primary prevention 
is generally applied to groups or made available to everyone. It can be anything 
which prevents the occurrence of disorder. (Our conflict is that we gag on the word 
treatment as reflecting an inappropriate medical model.) 

Problems of ideology often enter into attempts at defining primary prevention. 
There are political, social, and ethical implications when things are done to large 
groups (211). Elaine Cumming (98) divides mental health preventive efforts into 
two main strategies: (a) finding out what causes mental illness, then removing or 
preventing the cause, using malaria as her example; and (b) discovering the 
characteristics of immune or resistant populations and then making everyone else 
resemble them. She was quite aware of the ideological problems that might be 
involved in mental health preventive efforts. She quotes Bleuler who, in talking 
about prevention of schizophrenia, said “the avoidance of masturbation, of dis- 
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appointments in love, of strains or frights, are recommendations which can be 
made with a clear conscience, because these are things to be avoided in all 
circumstances.” (98, p. 3). Not everyone today would agree. 

The platforms of political parties, the writings of political theorists from Lenin 
and Mao, to Lincoln and Ghandi, to DeGaulle and Buckley, all claim as their 
major goal the improvement of the human condition, the creation of environ- 
ments that will reduce or eliminate human misery and emotional distress for large 
groups of people and for future generations. The writings of the utopians from 
Plato to Skinner promise a more rational, orderly, and satisfying life for all, or 
nearly all. Philosophers, theologians, economists, and humanists have in common 
with politicians, environmentalists, and liberationists the goal of a better and more 
satisfying tomorrow. Few have any published data! 

During the past year we found ourselves constantly writing references and ideas 
on scraps of paper and emptying our pockets each day of notes on the primary 
prevention relevance of childrens’ group homes, titanium paint, parent- 
effectiveness-training, consciousness raising, Zoom, Sesame Street, the guaranteed 
annual wage, legalized abortion, school integration, limits on international 
cartels, unpolished rice, free prenatal clinics, antipollution laws, a yoghurt and 
vegetable diet, free VD clinics, and a host of other topics. Nearly everything, 
it appears, has implications for primary prevention, for reducing emotion- 
al disturbance, for strengthening and fostering mental health. And anyway, as 
Bleuler said, they are good things in themselves. 


MODELS OF DISTURBANCE 


Our problem then is to review a vast literature having to do with the prevention of 
conditions which are not clearly defined, which vary in rate enormously as a func- 
tion of community tolerance for deviance, which change frequences with changing 
social conditions, and which may not even exist as identifiable individual defects. 

What order can we bring into the bewildering variety of efforts at primary 
prevention? A number of alternative approaches is available: SN 

1. We could adopt a developmental scheme, examining efforts aimed at different 
developmental stages—efforts at genetic counseling (e.g. 229), marriage counseling 
(332), pregnancy planning (88, 182), prenatal care (290), good obstetrical care, 
parental assistance and guidance (38, 80, 175, 181), well-baby clinics and early 
identification and intervention (e.g. 53, 97, 272), preschool programs and enrich- 
ment programs (e.g. 14, 236, 318), foster home and group home care (87), early 
school intervention and mental hygiene programs in the schools (e.g. 8, 13, 14, 50, 
55, 93, 94, 336, 337, 346, 356, 378), sex education and human relations groups, the 
variety of crisis programs available (e.g. 16, 122, 154, 189, 223, 224, 309), and so on 
through the life span (15, 44, 152, 157, 159, 242, 339, 347) to the plethora of 
proposed support programs for the aged. 

2. Or we could adopt a more usual scheme which separates the field into broad 
areas of preventive special focus: (a) preventing organic dificulties which cause 
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emotional distress such as untreated syphilis, lead poisoning, endocrinological 
malfunctioning, brain injury etc; (b) attending to the psychodynamic problems 
which could be prevented through “mental hygiene education” of parents, 
teachers etc; and (c) dealing with social problems which affect large numbers and 
which can only be prevented through efforts at changing the larger social envi- 
ronment. Sanford (301) gives a clear and concise overview of this particular logical 
division of effort (63). 

3. Or we could examine the separate mental “diseases” or conditions which are 
to be prevented with reference to programs designed specifically for each: schizo- 
phrenia (32, 171, 291, 292), alcoholism (261), manic depressive disorder (87, 
316), existential neurosis (222), juvenile delinquency (134, 135, 266, 286), addic- 
tion to drugs (39, 145, 316), mental deficiency (84, 156, 195, 267, 312), etc. 

4. Or we could argue that the prevention of mental “illness” or emotional 
distress is just a disguised form of the old medical model (hat we are really 
distracted by the conventional wisdom of our time, and that we should focus on 
the development of competence, or achievement, self-actualization, and on posi- 
tive mental health (e.g. 173). 

Unwilling to adopt one of these models, and without space to consider them all, 
we will simply draw a rough map of the swamp as we see it—pointing to quag- 
mires, putting in danger signs, and leaving much as terra-incognita in the hope 
that other explorers will be tempted by the lure of the unknown and mysterious. 

In any discussion of the field of primary prevention of disorders it early becomes 
imperative to define reliably the disorders that we are striving to prevent. Pro- 
grams of prevention in our area also talk about the development of positive effects, 
about fostering “mental health” (173). Similar questions of definition can be asked 
about these conditions we are attempting to foster. If we cannot define mental 
illness reliably, how can we prevent it? (62, 355). If there is no such thing as mental 
illness, if there are only problems in living (334), then it may be foolish to talk 
about prevention in the usual sense. If there are such “things” as emotional 
distresses and behavioral disturbances which have antecedent causes, then our 
task is not meaningless at all. But before we become optimistic we must consider 
the ethical problems of trying to prevent “problems in living.” 

If mental disorders are primarily organic (279, 280, 313), if they are caused by a 
discoverable physical defect, the strategy for reducing the number of mentally 
disordered people then involves public health efforts of the sort that have been so 
successful in reducing the incidence of other widespread diseases (112, 113, 206, 
221). If we are concerned only with prevention of the conditions which fit the 
diagnostic nomenclature of the Diagnostic and Statistical Manual No. 2 (10), per- 
sons who have been labeled schizophrenic, or neurotic, or brain damaged, then the 
problem, while still enormous, is of a particular order of magnitude. But if we 
define as emotionally disturbed those additional people who are prejudiced 
against other people because of their religion, age, or sex, then the number of 
disturbed people grows dramatically and the dimensions of the effort at preven- 
tion grow apace (1, 5-7). 
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Bower (51) cautions us: 


A common conception of prevention often obfuscates thinking and action, namely, that 
little can be accomplished short of major social overhaul. Prevention of mental and 
emotional disorders is seen as the exclusive result of the abolition of injustice, 
discrimination, economic insecurity, poverty, slums, and illness. To seek less is to 
attempt to fell a giant sequoia with a toy axe (p. 833). 


Different behavioral models have led to the choice of different areas of investi- 
gation. The organicists frequently look at the molecular level for their explana- 
tions of behavior. Research has established the effects of genetic factors (103, 160, 
295), nutritional factors (240), damage to the various parts and levels of the central 
nervous system, and other organic conditions which have measurable subsequent 
behavioral consequences (88, 248). 

There can be no socially oriented prevention without the acceptance of psy- 
chological determinism. Psychological determinism holds that all behavior is 
caused by antecedent interpersonal conditions and events. The scientific problem 
is to identify the antecedent—consequent pattern of relationships in order to be 
able to explain, predict, and control events (192, 225, 276, 288, 289). 

Many psychologists have focused on more molar events, particularly the ex- 
periences called conditioning and learning (e.g. 259). There is no question but that 
human behavior is changed and modified as a consequence of experience. Some- 
times both of these approaches are brought together into interactional studies. For 
example, there seem to be critical periods in the maturation of the human organ- 
ism when exposure to particular learning experiences has most definite and the 
strongest effects (see 306 for bibliography). When the organism is ready it can 
learn, but no amount of practice before this time has much effect, and practice 
subsequent to the critical period (306) has lesser effect. Proponents of the “psy- 
chogenic hypothesis” have argued that the most critical period for the learning of 
emotional reactions is during infancy and early childhood. This hypothesis sug- 
gests that the formation of adult personality is very largely the consequence of the 
emotional conditions during these early formative critical periods. This obser- 
vation is both obvious to most clinicians and difficult to prove scientifically. 
Clinicians tend to accept it unquestioningly, as a result of their experience, and 
scientists tend to question it because it is unsubstantiated by unambiguous re- 
search findings (179, 247, 280, 372). 

Not all investigators accept the psychogenic hypothesis. For example, Rimland 
(280) says “I predict that research will ultimately show psychosocial influences. to 
have minor—if any—relevance in causing the limited disorders called ‘neuroses’ 
and even less relevance in causing severe disorders known as psychoses” (p. 704). 
Hebb (155) supports this position, “there are no data to prove that... there is a 
class of ‘functional’ mental illness that is produced by emotional disturbance 
alone.” Frank (125) concluded that no factors of parent-child interaction could be 
found to be unique in any diagnostic groups, nor to distinguish these groups from 
controls (p. 191). 

Much of this iconoclasm will sound strange to clinicians with the social- 
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developmental-dynamic orientation. Every graduate student in psychology has 
been exposed to the “classic” works of people like Spitz (331), Goldfarb (138, 139), 
and Bowlby (58). The message transmitted by these different investigators has to 
do with the importance of early experiences of “mothering,” of the affectional ties 
between infant and caregiver, and of the terribly damaging later consequences of 
inconsistency, of placing infants in institutions and other settings where their 
affectional needs were not satisfied. Recently a widely quoted study by Heston 
(158) presented another side of the picture. Heston found that when infants who 
had been born to schizophrenic mothers were placed for adoption within a matter 
of days after birth, their subsequent rate for schizophrenia was considerably 
greater than the rate for a control group born to nonschizophrenic but emotionally 
disturbed mothers in the same institutions. Both kinds of studies have been 
criticized for methodological deficits and both must be viewed with some caution 
because, as is so often the case, the respective investigators believed they would 
find the results they did. 

It should be noted that critics of the psychogenic hypothesis do not say that 
childhood experiences do not cause later problems; rather they say that the 
relationship has not been proved. One has the feeling that they believe that organic 
explanations ultimately will be sufficient. If this were really true, our task would be 
easy, for it is in the complex area of social, cultural, and familial causation that we 
have our greatest confusion and complexity. In order to substantiate the “psy- 
chogenic hypothesis” it would be necessary to demonstrate with a high order of 
reliability a clear relationship between specific and objectively identifiable early 
forms of emotionally distressing experiences and subsequent behavioral distur- 
bances. It would also be useful to show that most persons whose early experiences 
were free of such disturbances developed into normal adults. It would also be 
desirable to show that persons with exceptionally warm and supportive early 
experiences were, as a group, relatively successful and creative in addition to being 
free of pathological symptoms. Because of the enormously complicated problems 
of stretching a research effort over the long years between the possible cause and 
the later effects, clear relationships are difficult to establish with scientific pre- 
cision. This difficulty is even greater than in other areas of prevention in medical 
epidemiology, because of the problems of definition described above. 

There are a few longitudinal studies of humans, and many with animals, which 
seem to support conventional clinical observations. Thus, for example, Bayley (25) 
reports strong negative association between maternal hostility during the early 
years of the child’s life and measured intelligence more than 30 years later, and 
positive association between ratings of maternal affection during the early years 
and intelligence in middle age (26). There are other studies which deal with 
various psychological and social factors in the development of mental distur- 
bances which are relevant to the psychogenic hypothesis (20, 66, 102, 104, 119, 125, 
126, 141, 161, 165, 166, 169, 170, 191, 232, 246, 251, 260, 275, 286, 306, 326, 353). 

The major strategies of primary prevention depend for their relevance and 
effectiveness on the truth of this proposition—that early happenings have later 
consequences, especially that emotional damage to the child is refiected in its adult 
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disturbance. There is no argument in the organic area. Organic damage is the 
easiest to measure. Thus lead poisoning from the ingestion of lead paint, for 
example, produces measurable changes in the blood and damage to the brain, 
with long-term consequences for the child’s ability to learn and to behave nor- 
mally (214). German measles during the first 3 months of pregnancy can produce 
irreversible retardation in the child. Chromosomal anomalies can lead to Down’s 
Syndrome and to other structural mental diseases. Blood changes, chromosomal 
anomalies, disturbed reflexes etc are all objective and measurable in the infant and 
young child. They usually are associated with conditions that bring the infant or 
young child to a medical setting where records are made and observations are 
quantified and stored (250, 251). Later investigators can go back and find these 
objective records and correlate them with current problems. 

The situation gets much more complicated when we search for relationships 
between social experiences during early life and later behavior. These relation- 
ships are more dificult to establish because of the more difficult problem of ` 
measurement. In the case of psychological influences—social experiences and 
events—no such permanent records are likely to have been made. The ambivalent 
alternation of affection and rejection by the mother, the child’s experience of being 
left frequently with a negligent baby sitter, the effects of the sudden withdrawal of 
love and attention at the time of arrival of a new sibling—where are these events 
recorded, and how are they retrieved? Three recent volumes on life history re- 
search in psychopathology go a long way toward summarizing the scattered 
evidence of these and related relationships (276, 288, 289). 

Yet much of our clinical experience suggests the powerful long-term effects of 
damaging emotional experiences which have occurred early in the life of individ- 
uals. Almost inevitably the evidence is less reliable than the evidence for organic 
factors. In the absence of well-documented longitudinal studies with follow-up 
observations, we must rely on retrospective studies with all of the methodological 
problems they entail. 

The psychogenic hypothesis seems often to elicit strong feelings of opposition 
from those hardnosed scientists who like to deal only with “facts” and objective 
measurements. The evening that Freud announced to the Viennese Neurological 
Society that all adult neuroses, in his experience, could be traced to early sexual 
trauma the reaction of his audience was either skeptical or hostile (178). Audience 
reaction continues to reflect these feelings. As evidence of the weakness of the 
psychogenic hypothesis the organicists point to the fact that some children from 
the most emotionally deprived and difficult backgrounds often grow up to be 
apparently well-adjusted and well-balanced adults. It is interesting in this respect 
to note the fact that only one person in 20 with untreated syphilis develops the 
mental illness we call general paresis. This low rate in no way invalidates the 
certainty of organic connection between syphilis and paresis. 

Clearly, the “psychogenic hypothesis” held by a majority of present-day 
clinicians seems to have had its primary origin in the work of Freud. Everyone who 
has studied Freud’s work is familiar with his attempts at relating adult disturbance 
to childhood experience. Yet Lemkau (204), making a careful search of all Freud’s 
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writings, including writing to a number of Freudian scholars, searching for some 
indication of Freud’s specific ideas on prophylaxis, concluded that Freud made no 
specific mention of the subject. 

Lemkau (204) does attempt to summarize Freud’s work as it relates to primary 
prevention with propositions such as the following: All behavior is caused and 
causes may be modified in such a way that undesirable behavior is avoided; 
emotional reactions follow a maturational process which is orderly and predict- 
able, and to some extent controllable; while human development involves stress, 
and excessive stress is unfortunate, consequences could be prevented by the 
control of excessive stress; personality is markedly influenced by emotional 
relationships particularly with parents; and finally, cultural factors have signifi- 
cant effects on the individual’s adjustment. While Freud never examined the 
specific implications of these relationships for prevention, a great many persons 
influenced by his work have attempted to apply these principles in the area of 
prevention (301). 

Sanford (301) discusses those followers of Freud, especially his daughter, Anna 
Freud, who have been interested in the application of psychoanalytic knowledge 
to prevention. The coincidence of social cataclysms, the Russian Revolution, the 
Great Depression, with the development of analytic theory, resulted in a school of 
socially oriented analysts. The most notable instigator of psychological and social 
theories relevant to prevention was Alfred Adler. Adler (3) argued for the de- 
velopment through education of social responsibility (also see Ansbacher & 
Ansbacher 12). 

But psychodynamic theories are not the only source of the psychogenic hy- 
pothesis. Darwin (see 75) had shattered the illusion of the fixity of the species, 
advancing the idea of the modifiability of every species including the human. 
Clifford Beers wrote his book in 1908, and the mental hygiene movement sprung 
out of the ready zeitgeist. William James and Adolf Meyer were influenced by 
Beers, influenced him in turn, and helped spread the idea that “mental disease” 
could be prevented if the human environment was improved. 

Bloom (43) quotes Meyer as follows: “Communities have to learn what they 
produce in the way of mental problems and waste of human opportunities, and 
with such knowledge they will rise from mere charity and mere mending, or hasty 
propaganda, to well balanced early care, prevention and general gain of health” 
(p. 1). 

The point is that efforts at primary prevention must stress the temporal order of 
social cause and effect. If an effect is to be prevented, experiences that precede the 
phenomenon must somehow be controlled in order to prevent its occurrence. In 
the field of mental health, one of the major clinical hypotheses involves the 
long-term effect of infant’s and child’s experiences on adult personality and the 
causation of personality disturbance. If the evidence is accepted that certain 
experiences in early childhood have positive or negative effects on adult func- 
tioning, then efforts at prevention should attempt to maximize the early positive 
experiences and minimize or eliminate the negative experiences. Evidence sup- 
porting these relationships, we believe, is so voluminous as to defy summarization. 
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Both in infrahuman and human subjects careful research studies have demon- 
strated the power of early positive and negative experience on adult experience 
(note the earlier references), 

Important sources of documentation of the “psychogenic hypothesis” have 
come from both human and controlled animal laboratory studies (e.g. 61, 192, 199, 
335, etc). The work of Harlow (153) is well known in this context. Severe and 
pathological adult behavior follows from the absence of adequate mothering of 
monkey infants; lack of love is permanently damaging without the restorative 
effect of peer group play. Such laboratory studies are difficult to duplicate in the 
human arena. One of the most convincing was the observation of Skeels 
(321-323), who documented the remarkable and dramatic improvement in a 
group of very young children who were labeled retarded and who made dramatic 
and unexpected intellectual and emotional progress when “adopted” by a group 
of older retarded girls in an institution in lowa. Most of these children, Skeels 
showed, reached a normal level of intelligence and on being placed in adoptive 
families prew up to be normal healthy emotionally stable adults in contrast to 
those left in institutions. 

The important point is that unless the “psychogenic hypothesis” is an accurate 
reflection of the true state of affairs there is no basis for efforts at primary 
prevention which focus on the social and emotional experiences of children. To 
argue that the nature and quality of early experience has little effect on the 
development of adult personality, and especially disturbed personality, would 
make unnecessary all of the elaborate attempts at protecting infants and young 
children from psychological damage. Orphanages and foundling homes have 
been replaced by more carefully designed systems of foster home placement of 
children. Attention to the psychological and emotional needs of children is based 
on repeated clinical reports of the damage done by cold and impersonal handling 
(e.g. 57, 331). 

On the other hand, if we do accept the psychogenic hypothesis, then efforts at 
primary prevention, in addition to being concerned with prevention of organic 
and physical damage, must also be directed at the social and emotional experience 
of children. This latter effort will lead social scientists directly into social and 
political action in the attempt to shape social conditions in ways that directly or 
indirectly affect the security and emotional responses of parents and children. 

Both approaches to primary prevention (preventing or eliminating organic 
damage due to defective genes, disease, toxins, and accidents, and prevention of 
damaging emotional experiences) ultimately lead to some interference with indi- 
vidual freedom. In the extreme, primary preventive efforts might be expected to 
lead eventually to social control of child bearing, with licenses issued only to 
persons genetically sound, to prenatal tests which could result in therapeutic 
abortions, to sperm banks and ova banks etc (33). Similarly, efforts at primary 
prevention could ultimately lead especially to nursery and child rearing insti- 
tutions where the negative effects of emotionally immature parents could be 
avoided. 

Perhaps the most compelling reason for continuing the efforts of prevention, 
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given a lack of coherent understanding of what it is that we are really trying to 
prevent, is cited in Bloom’s paper on the medieval miasma theory (41). Miasma 
theory held that soil polluted with waste products of any kind gave off a miasma 
into the air which caused many major diseases. Bloom goes on to say that current 
theories of prevention may appear to be patterned after the model first proposed 
by the miasmatists. In spite of the groundless basis for miasma as cause, the 
attempts of the sanitarians did have the effect of reducing the rate of infectious 
diseases. Perhaps in our current attempts at social overhaul, sweeping clean with a 
wide broom will be efficacious only because we may inadvertently hit the appro- 
priate dirt. 


EPIDEMIOLOGY AND ECOLOGY 


It is public health dogma that no widespread human disease is ever brought under 
control by the treatment of afflicted individuals. Smallpox was not conquered by 
treating smallpox patients; neither was treatment of the individual the answer to 
typhoid fever, nor polio, nor measles. Every plague afflicting humankind has been 
controlled when discovery of the cause led to taking effective steps to remove it. 
This process is primary prevention. We vaccinate all children and adults so that 
they will not contract smallpox. We give children polio viruses that have been 
killed or attenuated for the same reason, and we give them measles vaccine, and 
we put fluorides into their drinking water to prevent dental caries. 

There are at least two approaches to establish the research base for preventive 
efforts in mental health. The first is the attempt to study the possible biological, 
psychological, and sociological factors in the origins of disorders discussed above 
under the headings of biogenesis and psychogenesis. A second is the attempt to 
study the pattern of distribution of disturbances in the population: epidemiology. 

According to Gruenberg (149), epidemiology “includes a study of the factors 
that determine the patterns of disease occurrence and how the factors can be 
modified to eliminate the disease from the population.” He goes on to say: 


Epidemiological analysis follows a fairly simple pattern. A factor, f, is suspected of 
playing a role in the production of a disease, d; populations having much more f are 
examined to see if they have more d than populations having little f; through inferences 
based on all relevant knowledge and on theories derived from clinical familiarity with 
the disease, the investigators obtain a picture of the chain of circumstances that leads 
to the occurrence of the disease; finally, crucial links in the chain are attacked ex- 
perimentally. The relationships are often highly indirect and highly complicated but 
this general strategy has succeeded in bringing many animal and plant diseases under 
control (149, p. 2). 


Obviously this approach to the study of mental disorders has its problems. One 
is the absence of any objective means of identifying the presence of the “disease.” 
It is relatively easy to study the epidemiology of diabetes, or bone fractures, or 
murder, because each provides objective data to be counted. The effectiveness of a 
prevention program can be measured by looking at changes in rates. But if we 
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have no reliable way of identifying the phenomenon, the problem of measuring 
change becomes insurmountable (105, 114, 115). Mental illness usually involves a 
social judgment about behavior, a social decision to call the behavior deviant or 
sick. Barbara Wooten (367) has discussed this problem at some length. She says: 


... anti-social behavior is the precipitating factor that leads to mental treatment. But at 
the same time the fact of the illness is itself inferred from this behavior: indeed it is 
almost true to say that the illness is the behavior for which it is also the excuse. But any 
disease, the morbidity of which is established only by the social failure that it involves, 
must rank as fundamentally different from those with which the symptoms are inde- 
pendent of social norms. 


Obviously the rate of mental conditions is not only a function of the frequency 
of the “pathological” behavior, but also of the community’s tolerance for the 
behavior. Thus, San Francisco has been called “the tolerant city” because much 
wider ranges of deviant behavior are tolerated by the citizenry. This would tend to 
decrease the rate of reported disturbance there in comparison with what the rate 
might be in other cities, except perhaps that the tolerance may also serve to attract 
a wider range and larger number of deviant persons to the more tolerant envi- 
ronment. 

Primary prevention may demand various kinds of social action to reduce the 
subsequent rate of disturbance. Paradoxically, and perhaps only in the case of 
mental disorders, increasing community toleration also has the effect of reducing 
the rates. Sexual behavior which is forbidden by law in certain states or countries, 
and which leads to severe penalties if discovered, may well be tolerated in other 
states or countries. Where homosexuality was once punished as a monstrous 
. deviation it now has been decriminalized in parts of the world if occurring 
between consenting adults. A newly formed Association of Gay Psychologists is 
working to increase public understanding and tolerance of this sexual choice. A 
recent vote of the American Psychiatric Association has removed homosexuality 
as a disease entity from the diagnostic and statistical manual. (This is a fascinating 
use of democracy rather than science in the description and definition of “dis- 
ease.”) Similar changes in attitudes have occurred toward consumption of por- 
nography, and the result is a change in reported rates of consumption independent 
of a change in the actual rate of the behavior. 

Obviously then one way to reduce the rate of mental disorders in the population 
would be to change the definition of what constitutes mental disorder. We could 
either sharply limit the use of the term “mental illness” to the most obviously and 
seriously disturbed people, or we could broaden the definition to include mild 
disturbances to which everyone is subjected (268). Either change would have 
immediate effects on rates and would be reflected in our epidemiological studies. 
Inasmuch as public attitudes about mental disorder tend to be more tolerant (S. 
Starr in 177) than the views of the experts, it is possible to manipulate data on rates 
by manipulating the groups that make the definition. It is possible to manipulate 
opinions about mental illness through various kinds of instruction, information, 
and personal contact (110, 241, 249). The extensive literature concerning both 
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public attitudes and professional attitudes has been reviewed recently by Judith 
Rabkin (268). 

Reviewing the earlier literature on the epidemiology of mental illness, Plunkett 
& Gordon (263) suggested that its value is more potential than actual. Sources 
of data were examined by them and found wanting. The inability of the field of 
psychiatry to develop a meaningful and reliable classification system was iden- 
tified as a major problem. Plunkett and Gordon set forth a preliminary research 
plan for well-controlled field experiments to identify and codify causal factors in 
such problems as suicide, alcoholism, postpartum psychoses, and psychosomatic 
disturbances. Because each of these conditions offers some hope for objective, 
reliable identification, their distribution in the population might be measured as a 
first step toward discovering causal factors. But these authors, both experienced 
epidemiologists, were less than sanguine about our chances for the meaningful 
study of the other so-called mental illnesses. 

In any attempt to assess the success of efforts at primary prevention, one is faced 
with examining programs that are aimed at conditions of uncertain identification, 
and of unknown distribution, in an area where objectivity is lacking or may be 
unobtainable (e.g. 115, 116). 

In spite of the difficulties involved in the epidemiological approach in this area, 
a number of studies have been conducted. Gruenberg (149, 150), Hollingshead & 
Redlich (162), Langner and Srole (197-199, 333), Leighton (200), Pasamanick 
(257), Zusman (381) have all reported epidemiological data on general popula- 
tions. An excellent review and critique of problems in this field is by Miller & 
Mishler (232). 

Epidemiology has also been applied to more specialized populations such as 
elementary school children (83, 127, 133), native Americans on reservations (315, 
316), and of course to many specific disorders. An excellent example of the latter is 
the work of Calahan (65) on alcoholics. 

One of the earliest epidemiological studies of mental disturbances and mental 
retardation was the “Jarvis Report,” the Report of the Commission on Lunacy and 
Idiocy in Massachusetts, 1855 (174). In this classic study Jarvis was convinced by 
his data of the intimate relationship between pauperism and insanity. Clearly 
there were more insane persons among the poor, and especially among those who 
were paupers. One hundred years later Ryan (297) reported high rates of un- 
treated or neglected disturbance among the poor. Multiproblem families, together 
with the children of the poor, and the elderly had the highest rates of disturbance. 
Most of the recent epidemiological research reports the same finding (e.g. 104, 126, 
162, 167-170, 228, 232, 277, 348, 370). 

The critical problem is “attribution of cause.” Does poverty cause in a direct or 
indirect way the high rate of mental disturbance among the poor? Does poverty 
cause high rates of crime, alcoholism, drug addiction, and related indices of social 
pathology? Or perhaps both poverty and various forms of pathological behavior 
might be due to some common underlying defect. Jarvis (and more recent con- 
servatives) hold the latter view. Jarvis saw poverty as “An inward principle, 
enrooted deeply within the man, and running through all his elements ... and 
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hence we find that, among those whom the world calls poor, there is less vital force, 
a lower tone of life, more ill health, more weakness, more early death, a dimin- 
ished longevity. There is also less self respect, more idiocy and insanity and more 
crime than in the independent.” These “diseases” of poverty and lunacy were 
traceable to an “imperfectly organized brain and feeble mental constitution” (114, 
pp. 45, 52-56). It is instructive to remember that the foreign insane paupers whom 
Jarvis considered defective were the immigrant Irish, whose descendants have 
long since disappeared into middle class respectability and whose rates of mental 
disturbance have dropped dramatically in the intervening century. It is instructive 
also to note that each succeeding wave of immigrants to the inner city—the 
Scandinavians, the eastern European Jews, the southern Italians, and most re- 
cently the Chicanos, Puerto Ricans, and Blacks—have taken their place among the 
Class Five poor and have shown high rates of social pathology during the time that 
they occupy the low rung on the social ladder. It seems reasonable to argue that if 
both poverty and a high rate of social pathology were due to some common defect, 
genetic or constitutional, the rates for these groups would not decline precipitously 
as they achieve economic security. Such evidence is difficult to document statisti- 
. cally, but circumstantially at least we would argue that the predominant weight of 
evidence supports the position that poverty causes social disability rather than 
both being due to some inherent common defect (298, 299). 

Much of this literature has been reviewed recently by Elizabeth Taylor Vance 
(343). She reviews the widespread efforts in conceptualizing the dimension social 
competence—social disability which she sees as the major continuum for a wide 
variety of syndromes such as process schizophrenia, antisocial behavior, and 
cognitive deficit. In her thorough review she suggests that “there is much evidence 
to suggest that the extrinsic conditions in contemporary life that seem most 
conducive to the development of syndromes of social disability converge in the 
environmental correlates of urban poverty. Social disability would appear to be a 
dynamic and organic outcome of such conditions” (p. 508). 

The clear implication is that individual therapy will be helpless to break the 
cycle. Rather “the traditional activities of the clinician must be modified in the 
direction of greater involvement in the developmental, environmental and social 
context of the individual. On the other hand, an intimate knowledge and concep- 
tualization of human behavior and development should guide the shaping of 
community psycholo gy activity” (p. 508). 

In this regard, primary prevention of mental disorders i is frequently claimed as 
“belonging” to community psychology, a new applied specialty field. 

Many persons date the beginning of community psychology with the Boston 
(Swampscott) Conference of 1965 (34). There a group of persons dissatisfied with 
the traditional and reactive role of clinical psychology invented or appropriated 
the term “community psychology” to designate a new specialty more interested in 
prevention than in psychotherapy and rehabilitation. Community psychology 
deals with social systems, programmed interventions, and the planning of envi- 
ronments. It aims at enhancing human competency and adheres to an ecological 
rather than a psychoanalytic model. 
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Cowen (92), in his Annual Review article on community psychology in 1973, 
suggested that attempts at splitting review articles in this field into (a) intervention 
in psychopathology, and (b) prevention of mental disorders, leads to all kinds of 
conceptual difficulties. He suggested that eventually community intervention 
chapters might begin to fuse with prevention chapters. He searched for all the 
articles in the Community Mental Health Journal that dealt with “prevention,” 
“preventing,” or “prevent,” and he was frustrated to find that only some 3 percent 
dealt with these topics. In his search of the Coordinate Index Reference Guide to 
Community Mental Health (136) he found only 2 percent of the references clas- 
sified under “primary prevention.” He came to the disheartening conclusion that 
the social-community intervention area has largely ignored primary prevention 
and has failed to develop a research base. 

Ecological variables recently have come to be seen as possible contributing 
factors in the development of mental disorder (63, 107, 108, 128). Epidemiologists 
have long appreciated the complexity of ecological systems producing health and 
disease. Their insights are applicable to some rather serious mental disturbances 
which may not be diseases at all but conditions which are a result of complex 
ecological systems. A characteristic of an ecological system is that disturbance in 
one place and time may set off a complex chain of events which leads to conse- 
quences only dimly and distantly perceived. Thus the spraying of orchards in 
Minnesota with biocides may affect the oyster beds in Louisiana, with long-range 
consequences to the health of children eating oysters from these beds. Many years 
later a senator from Louisiana may die of a coronary, partly because of damage 
done by her childhood eating habits and food intake. But the chain is not so simple 
and direct. The prosperity of the Minnesota apple growers may have stimulated 
the production of beef in Iowa, making pork cheaper in Louisiana during the 
young adulthood of the coronary victim ... etc, etc. Similar complex chains 
certainly exist in the mental health field so that we should not be surprised to 
discover that only relatively modest research efforts have been made, and great 
controversy and disagreement exist, to tease out cause and effect relationships. 
Three recent volumes on life history research examine some of these research 
questions (276, 288, 289; see also 59, 60). 

In the complex “web of causation” is entangled a swarm of effects. Thus 
married persons in general have a smaller chance of developing a mental distur- 
bance than single persons. Elderly persons who are living with a spouse have a 
strikingly lower rate of hospitalization for mental conditions than solitary elderly 
folk (172). Children from intact families have lower rates than children from 
broken homes (254-256, 286). Even more complex societal variables enter into the 
web of causation. Persons living in preindustrial societies reportedly are rarely 
affected with manic depressive psychosis (235, 360), nor are they subject to the 
classical Victorian neruoses that Freud spent his life studying. The sexually re- 
pressed Victorians, on the other hand, developed high rates of depression and 
symptom neurosis when as children they were taught to inhibit sexual thoughts, 
feelings, and behavior in order to struggle upward into the middle class, taking 
either the route of education or capital accumulation. In the old industrial societies 
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like England and the United States the symptom neuroses are declining in fre- 
quency, but there is a dramatic rise in existential anxiety and the existential 
neurosis (Who am I? What is the purpose of my existence? What does it all mean?) 

Actually a number of social changes that have been adopted without consider- 
ation of their ultimate benefits in preventing mental disturbance may prove to 
have significant mental health implications. The discovery and widespread use of 
controlled methods of contraception should reduce the number of unwanted 
children. As these children often experience rejection and, as a consequence, 
emotional deprivation leading to later behavioral disturbances, their number 
being reduced should reduce disturbance rates. The widespread use of compulsory 
seat belts in automobiles, and the development of other safety devices including 
padded dashboards and inflation devices, should reduce significantly the number 
of injuries to the brain suffered in auto accidents, with the resulting prevention of 
mental disorders associated with laceration and trauma to the brain. Reducing the 
amount of carbon monoxide and lead in the air breathed by children in our cities 
should also have the effect of reducing the amount of cellular damage to the 
central nervous system and its consequences. 

One recent review (76) of ecological stress examined a whole range of genuine 
environmental stresses—such as radiation, laser beams, insecticides, noise pollu- 
tion, air pollution, temperature extremes, driving problems; overcrowding, etc (31, 
36, 85, 194, 245, 359, 362). These all have been shown to affect mental health in 
some way, and therefore their distribution and control should concern those in 
primary prevention. 

We do not know enough yet about all the possible effects of these and other 
contaminants, but the evidence that does exist suggests unfortunate behavioral 
consequences on animals and children. Unhappily, Dubos (107-109) has pointed 
out that the worst of the environment pollution contaminants are still to come. He 
cites English studies which demonstrate an apparently permanent malfunctioning 
of children reared under high pollution conditions. 

While psychologists have been interested in the study of stress for many years, 
the last decade has seen an increased concern with an ecological approach which 
examines the complexities of interacting factors in the environment inducing 
stress in the individual and thereby affecting mental health (141). Primary pre- 
vention in the mental health field ordinarily connotes prevention of psychological 
problems. But a voluminous body of evidence also indicates that the stresses of 
complex societies lead to disorders labeled psychosomatic. For example, a spate of 
recent books examines the relationship between life stress and coronary heart 
disease. The literature on stress inducing hypertension, duodenal ulcers, and other 
physical symptoms is enormous and obviously beyond the space limits of this 
chapter (see 87, 117, 230, 253, 269, 357, 364, 374). 

The new approach to the contribution of environmental factors on the devel- 
opment of mental disorders opens yet another alleyway in the maze of factors that 
must be studied and accounted for in attempting to organize the field of primary 
prevention. For those interested in pursuing further studies we can suggest the 
references cited above, as well as the National Institute of Mental Health’s Bib- 
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liography: Epidemiology of Mental Disorders: 1966-1968 (381), which lists some 
170 titles concerning the design and implementation of studies, over 500 articles 
on the etiology of mental disease, close to 400 on rates, and about 250 on the 
characteristics of “diseased populations.” (See also 60, 101, 141, 361.) 


PRIMARY PREVENTION PROGRAMS 


The social-developmental psychological. hypotheses concerning the origins of 
mental disorders has led to many programs and proposals that have as their focus 
either broad or specific disorders. These programs range all the way from the 
general education programs of the mental hygiene movement that started with the 
work of Adolf Meyer and Clifford Beers to more specific programs of early 
intervention. cited, for example, by Bower, Rolf, Zax & Cowen, Kelly, Garmezy, 
etc (56, 130, 185, 291, 338, 376). 

The question that we must address briefly is how primary preventive efforts are 
to take place. Knowing as much or as little as we do, what is to be done? The 
National Association for Mental Health (238) has prepared one of the most 
specific blueprints for action in primary prevention. It gives actual steps to be 
taken by citizens’ groups in their community to achieve the goal of the prevention 
of later emotional disturbance. The NAMH proposal focuses specifically on 
_ prenatal and perinatal conditions as areas where definite action should be taken. 
The time from conception to 6 months after birth was chosen as the action period. 
Among the actions recommended are prenatal and perinatal care affecting the 
expecting mother and the developing fetus, ways of coping with life crisis situa- 
tions affecting the mother particularly, the development of better parenting skills, 
the modification of the social system which may be expected to be a source of stress 
if unmodified (welfare, medical care, day care etc), and the insurance of adequate 
medical care for mother and infant following birth. 

There is no dearth of other suggestions concerning how primary preventive 
programs should be carried out (202, 226, 338). Caplan (73), for example, discusses 
those preventive programs which focus on strengthening physical and social re- 
sources: efforts to prevent prematurity; to provide appropriate nutrition to chil- 
dren with a genetic metabolic defect such as phenylketonuria; requiring iodized 
salt in areas where iodine supplies may be inadequate; prevention of psychosis 
due to pellagra; action to prevent lead poisoning in slum children; opportunities 
for cognitive enrichment for preschool children through the educational system. 
Caplan goes on to examine other psychosocial resources and suggests that efforts 
should be directed to the maintenance of a healthy family environment; to special 
intervention at the illness, hospitalization, or death of the mother; provision of 
homemaker services when needed; provision of in-hospital care by the mother 
when the child is hospitalized. He also suggests the application of crisis interven- 
tion as a preventive effort, as well as education of caregivers, mental health 
consultation, personal preparation for healthy crisis coping through education and 
anticipatory guidance (see also, 24, 30, 68, 217, 369, 371). 

Bloom (43) suggested three action tactics: crisis intervention; anticipatory guid- 
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ance, including, for example, meetings between clergymen and engaged parish- 
ioners, between social workers and groups of new parents; and consultation. 
Eisenberg (111) called for preventive programs to include: family planning, good 
health care and decent housing, adequate unemployment compensation and job 
training for displaced workers, case work service to minimize family breakdown, 
effective substitute care for homeless children, enriched school programs, recrea- 
tional facilities, vocational training, and other services, 

The National Institute of Mental Health (239, 241, 243, 244) has proposed a 
program of mental health prevention which is educational rather than clinical in 
conception, with a goal to increase the individual’s capacity for improving the 
quality of his life. It calls for use of a developmental approach, including efforts to 
improve early child development and to increase parental competence, then 
focusing on mental health aspects of public school education and on adolescence. 
It also considers special needs of high risk groups including high risk pregnancies 
with better prenatal care. Socialization training, development of competence, and 
improved self-esteem are included in the plan. 

Kelly (184, 185) talks of three types of therapeutic programs: (a) the clinical 
approach focuses on changes in individuals or small groups which radiates its 
effects to others as one of the indirect results of the services offered; (b) systematic 
changes in organizations are aimed at helping the organizations deal with future 
crises; (c) community organization techniques focused on mobilizing community 
resources for community action. Kelly gives examples of each of these approaches. 

Poser (265) suggests an approach to prevention based on the principles of 
learning. He suggests that mechanisms involving remedial learning techniques 
such as desensitization, flooding, and aversive conditioning be used to immunize 
subjects against later encounters with crisis. He suggests that primary prevention 
might best be achieved by the deliberate exposure of susceptible high risk individ- 
uals to learning experiences that are spontaneously encountered by most normal 
individuals. 

Smith (327) suggests that a real attempt should be made to identify those 
inevitable crisis or life stress events viewed as critical to the mental health of 
individuals. Then we should help or teach the individuals to deal effectively with 
those crises. Early intervention at the school or preschool level is really geared 
toward the prevention of disorders which occur later in life: 10, 15, even 20 years 
after the time of the intervention. Evaluative programs should engage in such 
long-term follow-up in order to ascertain the effectiveness of the program study. 
Unfortunately, this kind of long-term follow-up has not been characteristic of 
most studies in the mental health field. Exceptions are the Berkeley Growth 
Studies (25, 26), Project Talent, and the studies cited in Rolf et al (276, 288, 289). 
Future preventive approaches to mental disorder must provide the necessary 
long-term funding for follow-up measures. 

The above are just a sample of the discussion in the literature of general 
strategies for the prevention of mental disorders. What about more specific pro- 
grams? We will mention some representative current approaches to the prevention 
of mental disorders, without-attempting to evaluate the adequacy of these ap- 
proaches. 
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PROGRAMS 


The aim of preventive techniques in social planning is to use the social system to 
strengthen adaptive behaviors. Preventive approaches have been attempting to 
use major components of the social system, including the family, the schools, the 
world of work, and the community. 

There have been a number of fairly specific targets for preventive efforts. For 
example, the family has been a focus, including attempts at parental education, 
early family intervention as preschool intellectual training (e.g. 21, 47, 78, 80, 120, 
272, 284, 319, 320), work on training parents as therapists (e.g. 305), programs in 
the school (49, 52, 90, 91, 93, 94, 96, 97, 144, 304), programs in compensatory 
education such as the Headstart Program and the special programs for low income 
and minority groups (81, 82, 86, 90, 181, 189, 258, 278, 304, 381), and for the 
community (2, 9, 27,35, 118, 121, 148, 164, 183, 189, 190, 212, 218, 237, 264, 281, 
311, 328, 349, 352, 354). 

One major focus for attempts at intervention and prevention is at an early age 
with children. This area has been given special attention in the recent past. Both 
the Group for the Advancement of Psychiatry (147) and the Joint Commission on 
the Mental Health of Children (176) have focused on this important area and have 
pointed to the serious lack of mental health programs, not only primary but 
secondary and tertiary programs as well, for children. There are a number of 
programs aimed at children (4, 17, 37, 40, 61, 74, 129, 142, 175, 181, 193, 272, 307, 
317, 318, 340, 345, 351). Bolman reviews a number of these current programs (46). 

The important role of the school in the prevention of mental disorders, and the 
promotion of competence, has been discussed for a number of years (61, 163, 176, 
177). There is a growing number of programs in school settings to develop the 
competencies of school children and to prevent the educational and learning 
difficulties that can lead to further emotional and social maladjustment. It is 
noteworthy to emphasize that some of these programs are not, in the main, geared 
towards the prevention or amelioration of emotional disorders. They are in many 
instances geared toward the development of competence in the individuals. Many 
authors feel these are interdependent (13, 72, 89-97, 132, 144, 180, 216, 300, 373, 
375, 376). 

Bolman has reviewed programs for the family, and general programs (47, 48). 
Other program reports include those by Rees (271), Robertson (284), Ryan (296), 
the Skeels and Skodak studies (321-324), and that by Slobin (325). There is some 
question about whether crisis intervention and suicide prevention is primary or 
secondary prevention. Rather than go into a detailed review of these areas we list a 
number of references for those who are interested in this approach to preventive 
intervention (18, 19, 100, 122, 154, 189, 208-210, 223, 233, 285, 309, 310, 314, 315, 
350, 363). 

A major question to be addressed concerns the efficacy of primary preventive 
programs. Do they work? Much has been written about the evaluation of primary 
preventive efforts (42, 123, 130, 207, 231, 234, 252, 287, 288, 291, 308), and these 
references should be consulted for detailed information concerning evaluation 
research. The work of Campbell & Stanley (67) stands out as one of the definitive 
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works in the field of evaluative research. It takes into account the realistic con- 
straints that are found in doing action-oriented research in a number of fields, and 
its guidance is quite applicable to the field of primary prevention.” 


SUMMARY AND CONCLUSIONS 


Primary prevention in many areas may require social and political changes to 
improve the “quality of life” (302). Bower and others suggest that a lack of 
progress in the area of prevention may be traced to specific social resistances to the 
changes that would be required to actually make a difference. A major source of 
resistance is to the idea that real prevention must make major social system 
changes. It is true that many observers of our social pathology argue that patch- 
work solutions will not do, and that the whole structure of our polluted, indus- 
trialized, overpopulated, overenergized, overcrowded sexist and racist society 
breeds such massive human injustice and distress that the only hope for preven- 
tion is for major social reorganization. To prevent mental and emotional disor- 
ders, it is argued, we must abolish such injustices as unemployment, bad housing, 
social discrimination, personal insecurity, and poverty. As a consequence of 
the threat all of this holds to the status quo, the Establishment does little to 
encourage or support efforts at primary prevention in the social sphere because it 
believes, with some justification, that it would be funding programs aimed at a 
major redistribution of its power. Money is generally available for biomedical 
research on the search for the individual defect, but social programs are more 
often suspect (6, 186, 187). 

A second force operating to resist efforts at primary prevention, for example, is 
our societal commitment to privacy and to personal freedom of choice, “the right 
and privilege of each person, and family, in a free society to mind his own business 
and have others mind theirs” (51). Even such relatively stmple matters as the 
fluoridization of the water supply, and the requirement of vaccination against 
polio and smallpox—clear as is the research evidence which supports these prac- 
tices—elicit widespread resistance and public debate. To urge even more drastic 
invasions of privacy, to propose the more emotion-laden kinds of intervention 
believed by many to have profound preventive implications for mental health, is 
to call out even more violent resistance. Sex education, growth groups, con- 
sciousness raising, parent effectiveness training, day-care, etc are programs that 
make many members of our society tremble with anger (64). 


2 Since completing this chapter, we have seen additional reviews of the field of primary 
prevention which cover the subject, or parts of it, in much greater detail than we have been 
able to do in the space available to us. The interested reader is referred to these publications: 
(a) R. F. Mufioz and J. G. Kelly. The prevention of mental disorders. Programmed 
Learning Aid Series, ed. C. Cofer. Richard Irwin Co. In press. 

(b) J. Monahan 1974. The prevention of violence. In Community Mental Health and the 
Criminal Justice System, ed, J. Monahan. New York: Pergamon. 

(c) A. Binder, J. Monahan and M. Newkirk. Diversion from the juvenile justice system and 
the prevention of delinquency (same publication). 
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Society is ambivalent enough about requiring people whose behavior is already 
disturbed to undergo compulsory therapeutic procedures. Profound philosophic 
questions are involved in programs like behavioral modification aimed at im- 
proving the mental functioning. But society is highly ambivalent about utilizing 
some of these same techniques to intervene in the lives of individuals, especially 
children, who have not yet shown any kind of behavioral disturbance. Even when 
the approach is to prevent the development of later antisocial behaviors, as the 
implications in the work of the Gluecks (134, 135), for example, there is a great 
deal of concern over the rights of individuals to be let alone. 

We assume that both childhood experiences and societal rules are important 
determinants of later behavioral adjustment or disturbance. It has been noted that 
two of the most important activities engaged in by members of our society legally 
require no training or demonstrated ability—that of governing and that of raising 
children—and that any laws proposed in either a area would be quickly rejected as a 
serious curtailment of freedom. 

The explanatory models we use to account for mental disorder have profound 
implication for choosing our strategies for primary prevention. Most true diseases 
can be brought under control with relatively little disturbance to the existing social 
order. These are the familiar public health methods. The requirement of 
vaccination against smallpox implies some degree of coercion, the treatment of 
public water supplies with antibacterial agents to prevent disease, and with fluor- 
ides to prevent tooth decay, are public health measures adopted without the 
individual consent of the people affected. Most rational observers would agree 
that such preventive measures involve little threat to the social order. It is when we 
get to efforts at preventing mental disorder and emotional distress that real ethical 
problems begin to appear. To get really involved in primary prevention of emo- 
tional disturbance often suggests social and political change quite far removed 
_ from the conventional conservatism characteristic of middle class professionals. 
Here is another interesting paradox. Those persons most likely to understand the 
relationship between social problems like poverty, unemployment, discrimina- 
tion, and prejudice, and resulting high rates of mental disorder and distress, are 
members of the professions who work most closely with these “mental cases.” Yet 
members of professions, in large measure because of the nature of their profes- 
sional training, are selected to be relatively conscience laden upwardly mobile 
politically conservative-to-liberal, but certainly not revolutionary, social change 
agents. In addition, they tend to be the most informed and therefore the most 
logical ones to be concerned about individuals’ rights. Too often they remain 
silent. 

We can say with Bloom (41) that the most compelling reason for continuing 
efforts at prevention is suggested by the success of the miasmatists. Cleaning up 
the waste turned out to be the way to prevent disease. While elements of the theory 
were wrong the results were right. 

It is tempting to search for a simple formula to cover all of the complexities of 
environmental stress in causing disruptive behavior and emotional problems. It is 
tempting to suggest an extension to the human environment of Lord Acton’s (23) 
dictum: “Power tends to corrupt—absolute power corrupts absolutely.” Every- 
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where we looked, every social research study we examined suggested that major 
sources of human stress and distress generally involve some form of excessive 
power. The pollutants of a power-consuming industrial society; the exploitation of 
the weak by the powerful; the overdependence of the automotive culture on 
powerful engines—power-consuming symbols of potency; the degradation of the 
environment with the debris of a comfort-loving impulse-yielding society; the 
power struggle between the rich consuming nations and the exploited third world; 
the angry retaliation of the impoverished and the exploited; on a more personal 
level the exploitation of women by men, of children by adults, of the elderly by a 
youth-worshiping society—it is enough to suggest the hypothesis that a dramatic 
reduction and control of power might improve the mental health of people. It is a 
tempting oversimplification, an hypothesis we will not propose seriously, but one 
that we recommend for quiet contemplation. We have found it particularly satis- 
fying under relaxed environmental conditions—in a rowboat on a quiet Vermont 
lake on a sleepy summer day just before the fish begin to bite. 
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This review is a highly selective coverage of literature. Its purpose is to give a 
historical perspective in order to illustrate how early issues in the concept of 
schizophrenia have a direct bearing on contemporary problems of investigation . 
and how, in spite ofit all, some promising pathways may be present which can lead 
‚us out of this current problem wilderness in schizophrenia research. 

The concept of schizophrenia as we know it today was developed by Kraepelin 
in 1896 and called dementia praecox. Considering forms of insanity already 
described by other clinicians, he defined dementia praecox as a group of disorders 
of psychosis which were characterized by early onset and progressive deteriora- 
tion. “Dementia praecox is the name provisionally applied to a large group of 
cases which are characterized ... by mental deterioration in varying grades” (21, 
p. 219). 

Kraepelin was criticized for his concept primarily on four grounds. First, pro- 
gressive deterioration did not always occur; that is, some patients had remissions. 
Therefore, it was not always dementia. Second, the onset was not always early, Le 
during puberty and adolescence; therefore, it was not always praecox. Third, 
instead of defining a disorder by the custom of clinical manifestations, Kraepelin 
defined it in terms of the course of the disease. Fourth, the subclasses of dementia 
praecox were not mutually exclusive and did not lend themselves to reliable 
subdiagnosis. 

Kraepelin, besides being a sensitive clinician and lucid writer of modern psy- 
chiatry, was gifted with a flexibility in his thinking. He spent his remaining 
professional years after 1896 sorting out the clinical data and valid criticisms and 
trying to modify and improve the clinical groupings. Of particular interest are the 
positions he arrived at in regard to deterioration, remission, early onset, defining 
symptoms, and subclassification, considered in light of current knowledge. 

Even before Kraepelin’s writings were translated into English, it was clear that 
he acknowledged that progressive deterioration did not occur in all cases. 
“,.. whether the process is always the same is by no means certain. Dementia for- 
tunately does not occur in all cases, but it is so prominent a feature that the 
name dementia praecox is best retained until the symptom group is better under- 
stood” (21, p. 219). 
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Regarding remission, Kraepelin, like Bleuler (11) in his 1911 work, recognized 
that the personality of the remitted patient was characteristically modified from its 
premorbid state. Thus these cases of insanity could seldom be described as com- 
pletely cured. 

Writing a decade before the beginning of modern developments in the philos- 
ophy of science, Kraepelin was not cognizant of the pitfalls of using deterioration 
or incurability as part of the definition of a disorder. A clinician in Kraepelin’s day 
would designate a patient dementia praecox if insane, if the insanity fit no other 
diagnostic type, and if the onset was early. Strictly speaking, a final diagnosis could 
not be confirmed until the patient had expired with no evidence of remission. Any 
adequate diagnostic system today would separate diagnosis from prognosis. The 
diagnosis would be evaluated for its useful prognostic statements but would not be 
governed by them. 

By the time of Bleuler, it was also acknowledged by Kraepelin that not all cases 
had early onset, such as in puberty or adolescence. Current epidemiological data 
(44) indicate that the mean age of onset and first hospitalization for schizophrenia, 
although not during adolescence, is nevertheless distinctively earlier than that for 
the remainder of hospitalized mental disorders taken as a group. The peak age of 
first admission for schizophrenia falls between 25 and 40 years. After that the 
frequency tapers off. For nonschizophrenic first admissions, the disorders related 
to aging are so prevalent that the frequency curve continues to rise with increasing 
age. Thus Kraepelin. is found to be accurate in a relative sense—or else modern 
diagnosticians have simply learned it is unacceptable to label a first-admitted 
older person as schizophrenic. 

Regarding the lack of a defining clinical syndrome, Kraepelin did not depart 
from his scientifically careful position. Dementia praecox for him was an arbitrary 
generic grouping of poorly understood psychotic disorders. It was not a clinical 
syndrome or entity. He was willing to admit that variations and exceptions oc- 
curred which did not fit his arbitrary schema. 

Kraepelin changed his scheme of subclassifications. In his published lectures 8 
years later (41), “katatonia” was excluded and considered separately from the 
disorders of dementia praecox. In Bleuler’s (11) text of 1911, he appeared to have 
overlooked this change and responded only to the 1896 version of Kraepelin’s 
nosology. 

The tenuous advances in knowledge since Kraepelin’s time allow us to see his 
views as still valid on several counts. He hesitated to posit a clinical entity. Once 
posited by his successors, it still has not attained a standard of scientific certitude. 
He put emphasis on the formal aspects (or course) of the disorder rather than the 
clinical manifestations. Indeed, our current knowledge tells us that premorbid 
adjustment, speed of onset, and readmission history have proved useful. Clinical 
symptoms, on the other hand, are of less value because they are inextricably 
confounded with the nonpathological factors of personality, socioeconomic status, 
and level of cognitive functioning. 

Still another view of Kraepelin is of increasing importance. This view concerns 
the response to the momentarily presented stimulus. “Ordinary external impres- 
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sions are correctly apprehended, the patients being able to recognize their envi- 
ronments and to comprehend most of what takes place about them. Yet accurate 
tests show that very brief stimuli are not well apprehended” (21, p. 222). Recent 
evidence indicates much is to be understood about schizophrenia in the few 
seconds and fractions of seconds following such a stimulus. A major focus of this 
chapter will be on this promising aspect in the assessment of schizophrenia. 

Bleuler (11), whose 1911 text on schizophrenia was not published in English 
until 1950, rejected Kraepelin’s agnostic view. He declared that dementia praecox 
had sufficient common characteristics to be a clinical entity. He renamed the 
disorder schizophrenia to avoid troublesome connotations in dementia praecox. 
He recognized the multiplicity of the disorder as did Kraepelin, but he proceeded 
to point out the primary symptoms common to all groups of schizophrenics. 
Disorders of associations, affect, ambivalence, and sometimes autism were in- 
cluded. 


By the presence of the symptom complex so selected and defined (by Kraepelin) the 
great group of dementia praecox is characterized as a unit... A closer examination 
shows that, in fact, all these cases have much in common, that they are clearly marked 
off from other types of mental disease (11, p. 3). 


... Thus, we are left with no alternative but to give the disease a new name... 
schizophrenia . .. For the sake of convenience I use the word in the singular although it 
is apparent that the group includes several diseases (11, p. 8). 


In his declaration, Bleuler presented a substantive clinical point of view which 
was readily adopted by other clinicians. However, an apparent contradiction was 
initiated which continues to the current day in text books of psychiatry. After 
recognizing that schizophrenia is a group of disorders, its description is then 
elaborated as if it were a single clinical entity. To this day there is no firm evidence 
by which to say that the subclassifications of schizophrenia—either the Kraepe- 
linian subclassification or its later modifications—represent separate disorders or 
whether they represent separate personalities or cognitive styles onto which a 
single disorder has been afflicted. We do not yet know if Bleuler’s common clinical 
entity position is the mistake of the century which has misdirected the under- 
standing of schizophrenia or whether he was six decades or more ahead of his 
time. 

Misguided or not, the door was open for psychologists and psychiatrists to deal 
in assessment problems with more sophisticated and complex clinical content. 
Rather than the simplistic procedure to identify a dementia praecox, described 
earlier, the clinician was now expected to gain from clinical data a more detailed 
understanding of associations, affect, ambivalence, and bizarre behavior. 

It should be noted that this new emphasis on clinical assessment evolved from 
the fact that Bleuler stood between two giants of his day: Kraepelin with his 
nosology; Freud with his analysis of intrapsychic phenomenona. Bleuler wished to 
build an effective rapproachement between the two. In doing so, the focus upon 
associations of distinct and idiosyncratic nature was relevant to both Kraepelin’s 
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psychosis classification and Freud’s theory and treatment approach. Bleuler drew 
upon the work of Semon to elaborate the intrapsychic role of associations: 


In following Semon, Bleuler distinguished (that) associations are formed as a result of 
our experiences. They integrate themselves into clusters which under certain conditions 
can be evoked and be integrated with other clusters. In order to serve our cognitive 
adaptation, the associations must lend themselves to becoming ordered, ‘streamlined,’ 
and hierarchically organized; that is, they must become goal-directed (69). 


Jung, a colleague of Bleuler, developed the word association test as an assessment 
tool. However, it was not until the 1960s, when Bannister and his co-workers (4-7) 
employed the Kelly Rep Test (39), that progress was made in describing this 
interplay of schizophrenic associative clusters. 

The rapprochement attempted by Bleuler was not well received by Freud. 
Freud, while welcoming disciples, did not feel disposed to have his ideas “rap- 
proched.” Consequently, Freud concluded Bleuler was a victim of his own third 
primary symptom: ambivalence (69). 


History from 1911 to 1957 


A brief comment may be made about what happened in work on assessment of 
schizophrenia between 1911 and 1957. The work by Rorschach and later by 
Shakow represented significant steps. The work on subclassification of schizo- 
phrenia and later on the process-reactive (premorbid adjustment) concept are 
worthy of note. 

Rorschach developed and published in 1911 an inkblot test as a tool to diagnose 
psychopathology. This tool, for several decades an important diagnostic device, 
was a ready victim of abuse by the stupid and ignorant. The stupid accepted and 
propagated its clinical interpretations regardless of their validity. The ignorant 
rejected its validity out of hand without exerting the effort to learn which indices 
had evidence of experimental and clinical validity and which ones did not. While 
not to be reviewed here, the use of this instrument in assessing schizophrenia may 
be summarized in certain respects: 1. Schizophrenics, and others such as retar- 
dates, psychopaths, and brain damaged, showed poor form quality. 2. Schizo- 
phrenics gave “sick” content of dead, deceased, and impaired creatures. 3, The 
frequency of creatures being acted upon rather than taking action was high. 4. 
Peculiar (idiosyncratic) associations and displays of affect-occurred. 5. Boundary 
problems, especially the preserving of body boundary integrity, occurred. 6. Once 
identified as schizophrenic, a patient’s Erlebnistypus (movement vs color and 
shading balance) was useful in predicting the type of symptoms, e.g. hallucina- 
tions vs delusions. For the past the Rorschach perhaps served its greatest purpose 
in providing an entree for the clinical psychologist into the institutional store- 
houses to study psychopathology. For the future, the Rorschach, now becoming 
less used, will probably have made its greatest contribution by pointing the way to 
pertinent visual information processing phenomena which can better be studied in 
a well-controlled perceptual laboratory. 

Shakow and his group at Worcester State Hospital played a significant role in 
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introducing experimental psychological techniques into the clinical study of psy- 
chopathology. Looking for the simplest levels ofbehavior where the schizophrenic 
deficit appears, Shakow and his group found chronic schizophrenics to be normal 
in automatized (repetitive) motor response, sensory acuity, motor reflexes, and 
steadiness tasks. However, a breakdown occurred in speed of initiating a fast 
reaction to a predesignated stimulus. In effect, the apprehension of the briefly 
presented stimulus, mentioned by Kraepelin, had come back into study. Rodnick 
& Shakow (57) identified conditions under which schizophrenic groups were 
slower than normals without exception (i.e. without overlap in their distributions). 
Later these differences became better understood in relation to the length of 
preparatory intervals (between warning and imperative stimuli) and the unusually 
potent influence in schizophrenics of the preparatory interval in the immediately 
preceding reaction time trial (79). 

As a number of perceptual and motor tasks were examined by Shakow and his 
group, one disappointing type of finding occurred and recurred. Regardless of the 
cognitive or attentional task given, schizophrenics tended to perform more poorly 
and more variably than normals. This was disappointing to clinical investigators 
because it precluded the attractive possibility of explaining schizophrenia in terms 
of a single kind of performance deficit. The Shakow attentional set deficit notion, 
and later the inhibition interpretation offered by Zubin (81), represented the 
major ways by which to explain schizophrenic behavior. 

Among scholars of schizophrenia, Shakow’s experimental behavioral studies 
did not reach a level of popularity which was in proportion to their scientific merit. 
One possible reason was that they were conducted during a time when the ex- 
quisitely phrased psychodynamic interpretation had a greater appeal than cold 
numbers and statistical inference. Also, reductionistic thinking made biochemical 
hypotheses seem more real and relevant than millisecond latencies in a simple 
task. Another reason may be that to this day no schizophrenic has been arrested, 
held in jail, or committed to a mental hospital for having a slow reaction time, a 
strong crossover effect between regular and irregular series, or any of the other 
phenomena which were studied. Only the saying and doing of crazy things caused 
such consequences. Perhaps the time had not yet come when the typical clinical 
scientist could see the connection between millisecond differences in response and 
the infrequent but blatant characteristics of madness. 

Attempts continued during this period to sort out and improve the Kraepelinian 
subclassifications of schizophrenia. Elaborations were made in the Diagnostic and 

‘Statistical Manual of the American Psychiatric Association (1). Factor analysis of 
ward and interview behavior were conducted by Wittenborn (75) and Lorr, 
O’Connor & Stafford (43). No dramatic consequence has come as a result of this 
line of work. Perhaps it is because neither the clinical psychiatrists nor the factor 
analysts kept in mind that diagnoses, like other scientific constructs, must be 
developed in close relation to the variables with which they are supposed to be 
related, i.e. in this case prognosis and, if possible, treatment indications and 
etiology. In the extreme case, a classification system could be developed from data 
of schizophrenics’ arithmetic competence or some other equally irrelevant vari- 
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able. To scrutinize clinically or to factor analyze the behaviors seen often in 
schizophrenia does not mean an adequate classification system can be built from 
such an endeavor. 

Many steps were made in clarifying the process-reactive construct in schizo- 
phrenia during this period. Following Langfeld, Wittman (76) developed the 
Elgin Scale to measure it. Independently, Phillips (55), a member of the Worcester 
group who was studying indices to predict successful response to shock therapy, 
published the scale of premorbid adjustment which came to be viewed as con- 
ceptually equivalent to the process-reactive measure. In 1956 a classic study by 
Farina & Webb (24) revealed that regardless of shock or other hospital treatment 
good premorbid schizophrenics had better prognosis than poor premorbid schizo- 
phrenics. Since that time the process-reactive dimension has generally been 
recognized as the one best single indicator of prognosis in schizophrenia. This 
dramatic finding, while very important to current research in schizophrenia, was 
also observed two centuries before Kraepelin. Willis appeared to be aware of it in 
The London Practice of Physick in 1685: 


(Madness) rais’d on a sudden from some solemn evident cause, as from a vehement 
passion, is much safer than invading by degrees. 


.. Itis a common observation, that men born of parents that are sometimes wont to be 
mad, will be obnoxious to the same disease; and that often they have lived prudently 
and soberly above thirty or forty years, yet afterward without any occasion or evident 
cause will fall mad (74). 


A Turning Point: Visual Size Estimation Assessment 


In 1957 a major milestone quietly occurred in the history of assessment in schizo- 
phrenia. Jesse Harris, a graduate student doing a dissertation (34) with Garmezy 
at Duke University, was pursuing a Bruner-Postman-based hypothesis that affec- 
tive evaluative response would influence visual size estimation. Schizophrenics, it 
was felt, in particular would be influenced in evaluating size of threatening and 
nonthreatening mother-child interaction scenes. The hypothesis was tested with 
acute (i.e. early hospitalized) schizophrenics and normals, and no differences were 
revealed. The scale of premorbid adjustment, adopted by Rodnick and Garmezy 
from Phillips at Worcester, was applied to the schizophrenic subject sample. In an 
earlier Duke dissertation by Bleke this scale had been discovered to be useful in 
accounting for schizophrenic subject variance. The result with Harris’ data was 
something which had never previously occurred in the history of schizophrenia 
research. The two schizophrenic groups Classified in terms of premorbid history 
fell in opposite directions in size estimation level from the normal control group. 
Good premorbid schizophrenics tended to underestimate and poor premorbid 
schizophrenics tended to overestimate visual stimulus size. Normals did not differ 
significantly from veridicality. 

What did these results mean? Premorbid adjustment had already been iden- 
tified as a prognostic index relevant to schizophrenia. Was this an indication of two 


ASSESSMENT OF SCHIZOPHRENIA 599 


different types of schizophrenia with opposing abnormalities in perceptual behav- 
ior? Was it an indication that the singular disorder of schizophrenia affects the 
perception of people with different cognitive styles differently—such people also 
differing in ability to get out of the hospital and readjust? The certain and 
immediate outcome was that the article based on the dissertaion was accepted for 
publication (34) and was greatly ignored for a number of years. 


Silverman and Venables: Homogeneous Subgrouping 


In 1964 another milestone occurred. In two independent review articles, Silver- 
man (66) and Venables (72) pointed to the importance of having all schizophrenics 
under research study classified in terms of three dimensions: (a) premorbid ad- 
justment (process-reactive); (b) paranoid vs nonparanoid symptomatology; and 
(ce) chronicity (length of hospitalization). This was a major but not complete step in 
the direction away from considering the Kraepelinian classification system to be 
first and foremost. Only the paranoid-nonparanoid distinction was retained, The 
evidence from numerous research studies had made it clear that the great vari- 
ability in schizophrenic behavior was reduced if homogeneous subgrouping (or 
covariance) was carried out with these three dimensions. 

The manner in which Silverman arrived at these conclusions regarding sub- 
classification is of interest. A new emphasis upon scanning characteristics of 
schizophrenics was developed. Being a graduate student at the University of 
Michigan and studying for qualifying examinations, Silverman had read the 
Harris article and was intrigued by its unusual findings. He also discovered the 
early child development research on centration and decentration by Piaget, which 
related eye movement of young children to visual size estimation. Likewise, he 
read the work by Gardner and his Menninger colleagues on cognitive styles. In his 
integration of this material, Silverman concluded that Harris’ data could be 
explained on the basis of scanning and eye movement differences between good vs 
poor premorbid schizophrenics. The former, Silverman proposed, had frequent 
eye movements consequent to a cognitive style of extensive scanning; the latter, by 
contrast, had fewer eye movements consequent to minimal scanning. Silverman’s 
formulation suggested that paranoid and nonparanoid schizophrenics should also 
differ in size estimation level, and his own research proved this true (65). Silver- 
man’s fellow students were impressed by this integration of ideas, not only because 
of the insight it illustrated but also because Silverman stood on a table in the 
student study room as he first described it. 

A number of questions were left unanswered by the new formulations of 1964.! 
Thus the heuristic implications were formidable. The case made by Venables and 
Silverman regarding the importance of the three variables was based primarily 
upon studies which took only one or at most two of the variables into consideration 
at a time. No study had taken all three into consideration. No study had compared 


! While biochemical studies will not be reviewed here, it is worth pointing out that most of 
them have not taken advantage of the 1964 formulations. Hopefully, the advancement of 
biochemical understandings would profit from these principles also. 
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the process-reactive and paranoid-nonparanoid dimensions at the same time. No 
one had looked at scanning in psychotic children. No one had thoroughly studied 
eye movement in schizophrenics; only inferences had been made about it from 
size estimations. 

Subsequent studies indicated that among acute schizophrenics (hospitalized less 
than 3 years) good vs poor premorbid adjustment and paranoid vs nonparanoid 
symptomatology were indeed significant and independent (i.e. noninteracting) 
factors influencing size estimation (20). Thus Silverman’s (66) formulation was 
supported. No size estimation effect was found in disturbed acting-out and with- 
drawn-psychotic children, except the already established Piaget age relationship 
for children in general. That is, size overestimation decreased with increasing age 
(20). The phenomenon seen in adult schizophrenics does not seem to be present in 
children, When eye movement on the standard stimulus was restricted to 100 msec 
(a time which would preclude multiple information-search eye movements on the 
stimulus target), the same pronounced differences in size estimation between acute 
process nonparanoids and reactive paranoids were displayed (50). Thus eye 
movement, while remaining as a possible operant, could not be viewed as the sole 
factor which mediates this size estimation abnormality among adult schizo- 
phrenics, 

So the embarrassing situation faced by Harris was back again: a fascinating 
phenomenon was being observed when people showed pictures to patients, but no 
one had ah adequate way to explain it. 


Stimulus Redundancy Formulation 


A tentative explanation called “stimulus redundancy” was developed (15) on the 
basis of theoretical observations by Pribram & Melges (56) in neurophysiology. 
Essentially the formulation held that rates of processing of external stimulus 
information are determined not only by the availability of such information but 
also by controls within organisms. These controls can reduce or shut down the 
external input to produce a condition of stimulus sameness (high stimulus 
redundancy). On the other hand, the processing can be set at a high rate (low 
stimulus redundancy). Rate of processing was proposed to affect the way visual 
size ts estimated: high rate, low size; low rate, high size, for any given element 
processed; or stated differently, low redundancy, low size; high redundancy, high 
size. From this formulation, two ways appear available to influence size estima- 
tion, one primarily environmental and one primarily organismic. For size under- 
estimation, choose (a) a highly varied and/or changing field and/or (b) a subject 
who processes at a fast rate. For size overestimation, choose (a) a simple field or 
ganzfeld (stimulus-free surrounding) with a limited number of unchanging visual. 
elements and/or (b) a subject who processes at a low rate. Eye movement would 
have an effect so long as the shifting of images.to the retina would allow infor- 
mation processing rate to vary. 

If such a formulation were to have value, its predictability should be tested in a 
situation different from one on which supporting data was already available. Kar 
(38; see also 16), considering the Müller-Lyer illusion, noted that the processing 
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of information from the arrowheads and tails in this illusion figure obstructed the 
accurate size matching ofthe arrow shafts. Thus it was reasoned that increasing the 
nonrelevant processing would increase the illusion. This was done by adding 
distracting figures and words to the background on the illusion card. Acute reac- 
tive paranoid schizophrenics, alleged to be high rate processors, were expected to 
have increased illusion, while acute process nonparanoids were expected to have 
less illusion. Results gave partial support to the predictions. The background 
distraction, as predicted, greatly increased the illusion for all groups. The reactive 
paranoids, as predicted, displayed much greater illusion errors than the normals 
and the process nonparanoid schizophrenics. The latter two groups, contrary to 
prediction, did not differ. The final clencher came, however, when Kar asked 
subjects after the experiment to recall all the distracting elements they could 
remember. Reactive paranoids recalled more than normal controls; normal con- 
trols recalled more than process nonparanoids. These results on incidental recall 
were clearly in line with the predictions. From this study some promise was offered 
that the redundancy formulation might be useful. 

Clinical features and other research results were then related to the redundancy 
formulation (16). The high redundancy schizophrenics tended to have a gradual 
onset of psychosis, were consequently hospitalized from loss of family or com- 
munity supports rather than from sudden psychotic symptoms, were more often 
nonparanoid (28), were the victims of soft signs of organic brain damage (36) with 
increasing chronicity (71), were tolerant of stimulus deprivation but unable to 
tolerate high stimulus bombardment (49), tended to profit from phenothiazine 
therapy (27, 29), had a more stable course of illness without clear-cut remissions or 
enhanced floridity of psychosis, and had generally poor prognosis (24). The low 
redundancy schizophrenics tended to have a sudden onset of psychosis, were 
blatant in symptoms especially during the early (acute) stage, were prone often to 
remission and/or episodic relapses, were averse to stimulus deprivation, were not 
prone to profit from phenothiazines, and had a good prognosis for spontaneous 
recovery. 

Key questions remained: Does the redundancy dimension have anything to do 
with schizophrenia? Do the different processing rates result from schizophrenia? 
Do the rates reflect different disorders, or the same disorder at different temporal 
points during the progression of the illness or at different severity levels? Or are 
these processing differences a matter of cognitive style or other factors onto which 
a disorder has been imposed? No evidence now exists which will answer these 
questions conclusively. 

One type of evidence which suggests that high and low redundancy schizo- 
phrenics are different disorders comes from biological data. As already indicated, 
good premorbids do not profit from phenothiazine therapy, but poor premorbids 
do (27, 29). Kety et al (40), in comparing adopted acute (good premorbid, quick 
onset), chronic (poor premorbid, slow onset), and borderline (poor premorbid, 
slow onset) schizophrenics found evidence of the disorder in only the biological 
(not adoptive) relatives of only the chronic and borderline (but not acute) schizo- 
phrenics. Thus transmission of schizophrenia was associated with only poor 
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premorbid, slow onset patients. These findings suggest that only the high redun- 
dancy disorder is susceptible to biological transmission. A cloud of doubt was 
placed over the argument that good premorbid schizophrenia is merely a less 
severe version of process psychosis. From these data the borderline condition 
would seem to be the less severe form of process schizophrenia. On the other hand, 
these exciting data are still susceptible to alternative interpretations. That which is 
biologically transmitted may not be a direct etiological factor of high redundancy 
schizophrenia. It may be the transmission of one or more biologic inferiorities 
which add to life stress. If so, the genetic transmission of sporadic shortcomings, 
like deviant family communication (77) on the nonbiologic level, may summate to 
increase the probability of breakdown. 

European diagnostic practices suggest a difference between high and low re- 
dundancy patients. Diagnosis of schizophrenia there is greatly restricted to what 
would be described here as high redundancy schizophrenia. Several sources (45; 
Lennart Ka, personal communication; S. S. Kety, personal communication) 
make it clear that European, especially Scandinavian, definitions of schizophrenia 
tend to restrict or even exclude what Americans refer to as reactive (good pre- 
morbid) schizophrenia. Evidence of a clear precipitating event, marked affective? 
or confusional symptoms upon admission, and quick onset would be basis for 
ruling out a schizophrenia diagnosis in Sweden. Patients with good premorbid 
adjustment would be called schizophrenic retrospectively only if they continued a 
steady course of psychosis without remissions or episodes. In the United States- 
United Kingdom study (32), Zubin’s group conducted an intensive survey of these 
diagnostic differences. Consequently, European readers should be reminded that 
findings and observations reported here conform to the broader, less exclusive 
American definition of schizophrenia. 

All readers should be warned that some of the ambiguities of research findings . 
may be directly attributable to these differences in diagnostic practice. Bleuler’s 
suggestion that schizophrenia be treated as a reliable clinical entity has yet to be 
proved successful. 

Following the European diagnostic tradition, phenothiazines tend to be given 
primarily to those indicated by research as most likely to profit from them. Those 
who fail to meet the European definition of schizophrenia but still are psychotic. 
often receive antidepressant drugs or lithium but not phenothiazines. Yet in this 
country the distinction ts almost always ignored by clinicians. As a consequence 
we are in a golden age of research, which may not continue into the next decade. 
Process and reactive schizophrenics may be compared with little or no fear that 
they differ in kind or level of drugs in their clinical treatment. 

Recent work by Zigler and his co-workers supports an argument that high vs 
low redundancy is related to level and style of cognitive functioning independently 
of schizophrenia. Zigler was puzzled by the fact that Johannsen et al (35) had 


? Recent schizophrenia high risk studies have made it clear that schizophrenic females with 
offspring have abundant affective symptoms even though they may be designated on other 
grounds as process schizophrenics. 
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‘ reported an independence between premorbid adjustment and paranoid sympto- 
matology, while Goldstein, Held & Cromwell (28) reported two separate studies in 
which a particular relationship existed. According to the latter, good premorbids 
may be either paranoid or nonparanoid, but poor premorbids almost always 
tended to be nonparanoid. Sanes & Zigler (61) obtained data which supported the 
Johannsen et al interpretation of no relationship. The present reviewer (see 61) 
offered the interpretation that the differences in results may derive from cognitive 
differences in the VA and state hospital populations, the former being somewhat 
higher because of having passed a mental and intelligence examination in order to 
get into military service. Johannsen et al and Sanes & Zigler had studied VA 
populations, and Goldstein et al had studied state hospital populations. 

Zigler & Levine (80), therefore, conducted an investigation in both types of 
hospitals and found, in accordance with past results, that the VA population fit the 
conclusions of Johannsen et al (35) and the state hospital population fit the 
conclusions by Goldstein et al (28). Eisenthal, Harford & Solomon (23) failed to 
find such differences between private and state hospitals but observed that the 
base rate for diagnosing paranoid vs nonparanoid schizophrenia varied greatly 
from hospital to hospital. While 39% to 40% of schizophrenics in their and the 
Johannsen et al data were diagnosed paranoid, 59% in the Sanes & Zigler data and 
only 27% in the Goldstein et al data were diagnosed paranoid. Thus the base rate 
for paranoid status as well as social competence was different among the different 
hospital populations. The resulting effect on the relationship of premorbid 
adjustment and paranoid symptomatology may be viewed with at least two inter- 
pretations: cognitive level and admission evaluation differences. 

The first interpretation has the following logic. Schizophrenics classified as 
paranoid have by definition delusions as their prominent symptoms. Delusions are 
false ideas. To have a false idea requires the capacity to have ideas. Ideas require 
cognitive ability. Cognitive ability is higher in VA and higher socioeconomic 
populations. Therefore, such populations show a greater incidence of paranoid 
symptoms, but on this stratum the relationship between paranoid symptoms and 
premorbid adjustment does not necessarily covary. On the lower stratum of state 
hospital populations, the poor premorbid and nonparanoid characteristics would 
frequently concur. 

The second interpretation has the following logic. In better staffed, private, or 
training hospitals clinicians will more often spend a longer time examining and 
listening to a patient upon his admission. The longer a person (even this author) is 
listened to, the greater the likelihood that he will reveal a false idea. Once he has 
revealed a false idea, the likelihood is greater that a patient will be diagnosed 
paranoid schizophrenic. Thus the staffing structure of the hospital may influence 
the ratio of diagnosed paranoid vs nonparanoid schizophrenics. So the inarticulate 
and unattended poor premorbid patients in an understaffed state hospital are 
more likely to be diagnosed with a nonparanoid classification. 

Finally, Zigler & Levine (80) studied the symptoms of turning away from others, 
against others, or against self. They found that the poor premorbids (those with 
low social competence) tended to turn away from others (consistent with a non- 
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paranoid interpretation). Good premorbids (those with high social competence) 
took one of two directions. Either they had symptoms of turning against others 
(consistent with a paranoid interpretation) or against self (consistent with a de- 
pressive or schizoaffective interpretation). This pattern of symptoms presents an 
additional explanation of why Goldstein et al (28) found poor premorbids were 
almost always nonparanoid, while good premorbids tended to be either paranoid 
or nonparanoid. The good premorbid nonparanoids were apparently the ones 
with depressive or other affective components in their disorder. 

The results of Zigler and his colleagues are sobering (disappointing, if you will) 
to the clinical scientist trying to sort out the etiology of schizophrenia. They suggest 
that clrnical symptoms (delusions, affect, redundancy level) may be telling us more 
about the cognitive level of the individual than about the mental disorder. Perhaps 
itis not surprising, therefore, that recent research (70) has found clinical symptoms 
of limited value in predicting outcome in schizophrenia. 


Arousal 


Of what relevance is arousal to the redundancy concept, i.e. to the high and low 
rate of processing in schizophrenia? If any relationship exists at all, it is that the 
low rate processors (high redundancy schizophrenics) are higher in arousal. 
However, the findings do not seem to fit together into one neat conceptual picture, 
perhaps because arousal may mean different things when measured differently, 
and because the concept of schizophrenia is elusive. Before moving on to recent 
research which does fit into an exciting conceptual picture, it behooves this author 
to face some of the issues of arousal and schizophrenia and to do so with humility. 

Mednick (47) offered a theory that acute schizophrenics became hyperaroused 
through a cyclic pattern of increasing fear response and increasing stimulus gen- 
eralization. Chronic schizophrenics, on the other hand, were proposed to have low 
levels of arousal, possibly even lower than normals. This formulation accounted 
for overinclusive behavior in acute schizophrenics and a withdrawn quality in 
chronic schizophrenics. 

Venables (72) reviewed literature which proposed acute schizophrenics to have 
a broad focus of attention and chronic schizophrenics a narrow focus of attention. 
These results were interpreted in a direction reverse to Mednick’s formulation 
(47). That is, the acute schizophrenic was interpreted to be low in arousal and 
lacking the power to block out irrelevant peripheral stimuli in order to attend to 
the task. The chronic schizophrenic was interpreted as having high arousal and 
from that the power to channel attention narrowly. 

The research since that time has done some damage to the earlier positions of 
both Mednick (47) and Venables (72). Both have revised their formulations on the 
basis of their own and other data. Most research since their initial formulations has 
supported the notion that chronic schizophrenics are higher in arousal than acutes 
and normals, thus supporting Venables. The possibility that acute schizophrenics 
are lower than normals in arousal is in no way confirmed. Also, the relationship of 
arousal to broad versus narrow attention in the simple sense offered by Venables 
has lacked support. 
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The major areas of focus on arousal-related research more recently have con- 
cerned the GSR responsivity, GSR response recovery time, EEG power density 
variations, EEG alpha blocking latency and blocking characteristics, evoked re- 
sponse, and contingent negative variation. 

When schizophrenics were described in one research study (78) as highly re- 
sponsive in GSR amplitudes and in another study as low in responsivity (10), 
Gruzelier & Venables (31) took note of this discrepancy and proceeded to identify 
groups of schizophrenic “responders” and “non-responders.” The non-responders 
are deficient in two-flash threshold performance as compared to “responders” and 
normal subjects. Heart rate differences were also shown. However, contrary to 
what might have been expected, no findings have indicated a relationship between 
this GSR-responder dichotomy and the major variables related to the redundancy 
classification, i.e. paranoid symptomatology and premorbid adjustment during 
the acute phase of illness. | | 

Mednick (48), in his study of offspring of process schizophrenic parents, found 
GSR recovery time following a specific response to be the most important auto- 
nomic discrimination between children who later suffered mental breakdowns and 
those who did not. The ones susceptible to breakdown had quicker GSR recovery. 
Following up on this finding, Ax & Bamford (3) found that chronic schizophrenics 
also had faster GSR recovery than normals. Mednick interpreted fast recovery to 
mean that the individual has such a quick avoidance response to a fear-producing 
situation that the avoidance response never has a chance to extinguish. A slow 
recovery to a new baseline, i.e. continued responsiveness to the fear-producing 
stimuli, would allow time for extinction, i.e. for competing responses to replace the 
“fear-then-quick-avoidance” response. 

An alternative, and not entirely contradictory, interpretation to the Mednick 
reinforcement-learning one is that the continued elevation of the GSR prior to 
recovery may be a correlate of the amount of information processing of the 
stimulus event at that instant. If the schizophrenic, presumably in high redun- 
dancy mode, “shuts down” his information processing activity, then he would 
have less registry and less subsequent trace from the stimulus event. He would be 
likely to respond, as Shakow (63) described, to a later presentation of the same 
stimulus as if it were brand new. Consistent with this notion, Edelberg (22) has 
reported fast GSR recovery time to be associated with poor habituation, another 
characteristic of schizophrenic subjects.? 

Recent research on EEG has fairly consistently been interpreted to indicate 
schizophrenics are high in arousal. Goldstein & Sugermann (26), finding 


3 During a visit with E. Rodnick at UCLA in 1972, I was discussing Mednick’s and Ax’s 
new findings, and my interpretation of what they might mean, when Rodnick pulled from his 
shelf the dissertation of Carl F. Frost (25) which he supervised at Clark University in 1948. It 
contained charts and data analyses demonstrating the faster GSR recoveries of the schizo- 
phrenic subjects. Thus the findings described above do not represent the first time that fast 
GSR recovery phenomenon was observed in schizophrenics. Nor was it the first time I have 
been humbled by the vast personal experience and knowledge of people such as Rodnick, 
Shakow, and Zubin. 
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schizophrenics to be less variable over time in EEG power density, interpreted 
such patients to have a hyperregulated, hyperaroused system which did not allow 
fluctuations as a function of the incoming stimuli. Cromwell & Held (18) found 
reactive schizophrenics, ordinarily as slow or slower than normals in motor reac- 
tion time, to have a significantly faster alpha blocking latency than normals in 
response to a reaction time stimulus presented during alpha rhythms. Process 
schizophrenics could not be tested because insufficient alpha was shown. These 
findings were also interpreted to indicate high arousal in reactive schizophrenics 
and either higher arousal or a noisy brain in process schizophrenics. The oftheld 
interpretation that schizophrenics are slow in motor reaction time simply because 
they are not motivated to the task is difficult to reconcile with the short alpha 
blocking latency findings. Nideffer et al (52) studied alpha percent time in reactive 
schizophrenics and normals with eyes open and closed. They found the well- 
known differential probability of less alpha with eyes open (and more alpha with 
eyes closed) to be significantly greater in schizophrenics than in normals. They also 
found schizophrenics better in reaction time with eyes closed than open and 
normals better in reaction time with eyes open than closed. These results were 
also interpreted in terms of a greater arousal in reactive schizophrenics as regards 
visual sensitivity. 

Research on visual evoked response (13) has led to the identification of patterns 
of augmenting and reducing in an early component of the averaged wave. Aug- 
menters tend to have increasing amplitude of the wave component with increasing 
light intensity; reducers show less amplitude at high light intensities. While this 
finding appeared to have implications for both arousal and redundancy, Schechter 
& Buchsbaum’s (62) subsequent research has indicated that this effect is greatly 
influenced by instructions and attentional set given to the subject. Meanwhile, 
Asarnow (2) has found the “reducing” tendency to be highly related to digit 
‘symbol! deficit, a well-known psychoneurological soft sign of organic brain dam- 
age. Also, while schizophrenics tend in general to be greater augmenters than 
normals, those without a family history of schizophrenia tend toward the “reduc- 
ing” tendency more than schizophrenics with a family history of schizophrenia. 

As indicated previously, the conclusions which can be drawn about arousal in 
schizophrenics are tenuous. Some data indicate that schizophrenics, acute and 
chronic, are high in arousal. However, one should be cautious, if not resistant, in 
generalizing from one arousal measure to another, or in assuming arousal is a 
monolithic construct. It would also appear that the measurement of arousal, 
whether singular or plural, is often overriden by other factors. At least one of these 
factors is probably the redundancy level. In other words, both extremely high-rate 
and low-rate processing of external stimulus information occurs during high 
arousal in schizophrenics. 


Predicting When Pathology Will or Will Not Occur 


A most beneficial trend in recent research has been away from schizophrenic- 
normal comparisons. An increase has occurred in the number of studies to predict 
when schizophrenics will or will not manifest their pathology. Schizophrenics have 
long been noted for displaying normal behavior most of the time and showing 
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their pathological behavior only on rare critical occasions (33). Nideffer & Crom- 
well (51) illustrated this fact by examining choice reaction times of chronic good 
and poor premorbid schizophrenics and normal subjects. Looking into a dark box 
with two pinpoints of light, subjects lifted the left forefinger from a key if the left 
light went out and the right forefinger from a key if the left light went out. Trials 
were plotted as a trade-off function between speech and accuracy. As expected, the 
faster, the less accurate; the slower, the more accurate. Poor premorbids, usually 
slower than good premorbids, were surprisingly faster than the goods and were 
equal to normals, perhaps because all peripheral visual stimulation was removed. 
Normals had an unusually straight line function of accuracy as a function of speed. 
schizophrenics, however, especially the good premorbid patients, had a trade-off 
function which bent at the slow end as a result of a large group of trials which were 
both very slow and inaccurate. Thus two schizophrenic deficits were described. 
One was a general slowness in reaction time at all levels of accuracy, which was 
shown only by the good premorbids. The other was a “high-redundancy” or 
“out-of-contact” slowness, where schizophrenics were not only slow but were 
inaccurate at a response speed level where accuracy should be optimal. Since the 
latter deficit occurs only on some trials and not on others, an important unan- 
swered question concerns the trial-to-trial determinants which are related to this 
sporadic deficit. 


THE WATERLOO sTUDIES What are the conditions which will cause a schizo- 
phrenic to act schizophrenic, and what conditions will produce essentially normal 
behavior? Steffy, his students, and co-workers at the University of Waterloo have 
perhaps made the greatest recent impact on this question. Their strategy, like that 
of Shakow’s group, has been to study reaction time, the simplest level of response 
in which schizophrenic pathology occurs. 

As a preface for these Waterloo studies, some of the major reaction time 
findings of the Shakow group will be summarized. Schizophrenics were found to 
be slower than normals in simple reaction time. Also, they were found to have 
greater intrasubject and intersubject variability than normals (64). The prepara- 
tory interval (PI; time between warning and imperative stimulus) was found to be 
an important source of variance. When Pls of different lengths are presented in 
regular series (e.g. a series of 3-sec PIs, then 1-sec Pls, then 7-sec PIs), the results 
are different from when they are presented in irregular series (random order of PI 
length from trial to trial). Regularity means a degree of certainty about what PI is 
coming next. Normals tend to utilize the information provided by regularity and 
to perform faster than when PIs are irregular. Many schizophrenics, however, 
show a crossover effect which normals do not show. With trials of short PIs a 
regular series yields faster reaction times than an irregular series. A reversal occurs 
with Pls longer than approximately 5 sec. That is, when the series is regular and the 
PI is long, schizophrenics perform more poorly than if it were an irregular series. 
Why should certainty of information impair schizophrenics? The mystery of this 
phenomenon had been reduced when Zahn, Rosenthal & Shakow (79) identified 
the PPI-PI relationship. The PI on the immediately previous trial (the PPI) was 


608 CROMWELL 


found to have a great influence on the subsequent trial. When the PPI was greater 
than the PI (i.e. when a long PI was followed by a short one), the reaction time 
was slower. When the PPI was less than the PI (a short PI followed by a long 
one), the reaction time on the latter was relatively faster. Schizophrenics were 
found to have this PPI “overdetermination” more than normals. This helped 
explain the crossover phenomenon in schizophrenics. In an irregular series, a long 
PI would have a high probability of being preceded by a shorter PI, and vice versa. 
Thus the curve of reaction time as a function of PI in the irregular series would be 
affected in a direction consistent with the crossover effect. 

The “catcher,” however, was this: The role of the PPI in accounting for the 
crossover effect was marred by one additional finding. While PPI<PI gave better 
performance than PPI>PI, the PPI = PI combination was shown to yield the best 
performance of all (54). This fact should override the PPI-PI inequality effect and 
still give the advantage to a regular series at all levels of schizophrenic reaction 
time performance. Why do the crossover data directly contradict this expectation? 
Steffy was the first to see the importance of this question. 

To grapple with the question, Bellissimo & Steffy (9) introduced a series of four 
regular PI trials within a long irregular series. A striking thing happened when this 
procedure was conducted with process schizophrenics. In the interpolated regular 
trials, relatively independent of length of PI, the reaction time at first improved 
and then became progressively worse. The immediate improvement was what had 
been observed in the PPI = PI effect (53). The subsequent deterioration had never 
before been identified. With short PIs the immediate improvement was more 
pronounced. Thus two different contradictory factors seemed to be operating: an 
immediate facilitation from the certainty information, then a deterioration re- 
sulting from repetition, all-within a four-trial span. Bellissimo & Steffy interpreted 
this progressive deterioration from trial to trial as a redundancy-related deficit. It 
was as if an event had occurred (the discovery of regularity after a prolonged 
irregular series) which caused the window shades to come down on the attentional 
focus of the process schizophrenic so that the outside world would be less and less 
adequately responded to. Since schizophrenics are usually unable to describe any 
change in the PIs in the Bellissimo-Steffy paradigm, it would appear that change of 
response takes place on a level of information processing which is not always 
linked to verbal awareness. Alas, this finding also accounted for the existence and 
extent of the mysterious and paradoxical crossover effect. 

The crossover and redundancy-related deficit were then subjected by Steffy’s 
group to several systematic investigations. The crossover had been shown in the 
original study (9) not to occur in reactive schizophrenics or normals. How durable 
was it, however, in process schizophrenics? As a first step, Steffy* conducted a 
study in which schizophrenic reaction time was lowered by a combination of 
preliminary training and positive urging and support from the examiner. Dra- 
matically, another important finding emerged. While schizophrenic reaction time 
went down into normal limits through the positive urging, the crossover remained. 


4 Unpublished research. 
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Thus the crossover effect, like anticipatory responses with long Pls (53), is a more 
enduring and distinctive descriptor of process schizophrenia than is slow reaction 
time. 

The durability of the crossover effect was then tested in a different way. Bellis- 
simo & Steffy (8) examined different PPI-PI relationships in the interpolated 
paradigm. Their purpose was to seek some condition where the paradoxical effect 
might disappear, but they failed to do so. In one condition, where the antecedent 
irregular trials had a range of short PIs and the interpolated regular set were longer 
in PI length, the latter regular series produced slower reaction times at all PI levels 
tested. In other words, uncertainty gave better performance in schizophrenics in 
short as well as long regular PIs. 

Finally, Kaplan (37) found a way to “cure” the crossover in process schizo- 
phrenics. After a pretesting to determine the presence and extent of the crossover, 
one group of subjects was given training on length-of-line judgments, then on 
length-of-PI judgments. After subjects reached a criterion in their training on the 
temporal] discriminations, they were then posttested. The crossover disappeared. 
In a control group of schizophrenics where the training involved recognizing 
shapes, the crossover remained. After these initial findings, Kaplan extended his 
investigation by comparing four groups of process schizophrenics in obstruction of 
the crossover effect. Essentially the findings confirmed and extended the previous 
study in showing that the pretraining (i.e. the increasing of certainty in PI infor- 
mation) could obstruct the crossover effect. In addition, instructions that PI was 
going to be “same” or different” during posttest trials contributed additively to the 
crossover obstruction. The remarkable observations, however, were first that three 
of the four groups showed return of the crossover in immediate posttesting. 
Second, when the group with the most powerful anticrossover interventions was 
retested a month later, the crossover returned in three of eight subjects. Both the 
lasting effects of the obstructions in some and the resilience of crossover in others 
was demonstrated. 

Along with testing the durability of the crossover effect, another study was 
conducted which magnified it. If the crossover results from interference with the 
maintenance of set, then it should become greater with longer intervals between 
trials. Longer intervals would allow more opportunity for interference to take 
` place. On the other hand, if the crossover phenomenon results from an accumu- 
lated inhibitory factor which dissipates with the passage of time (e.g. defensive 
or reactive inhibition), then the crossover should become greater with shorter 
intertrial intervals. Steffy & Galbraith (67) examined crossover in -process 
schizophrenics with 2 and 7 sec intertrial intervals. The crossover was distinctly 
greater with 2 sec between trials. No difference in mean reaction time accompan- 
ied the difference in amount of crossover. These findings suggested that the 
phenomenon being studied could not be explained by set factors. 

If a stimulus condition so mild as a change in PI format could precipitate a 
momentary high redundancy condition, as shown by Bellissimo & Steffy (9), could 
other mild but novel stimuli do the same thing? And if they could, what time is 
required for the process schizophrenic to recover? Steffy & Galbraith (68) ad- 
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dressed themselves to these questions by introducing an unexpected border of Xs 
around the visual ready signal on a display scope and flashing a brief unexpected 
dim light in the dark testing room when a visual warning signal initiated the PI. 
Trials with this gentle intrusion were compared with those of the same PI length 
which had no such intrusion. Normals showed no effect from the intrusion. 
Reactive schizophrenics showed a slowing when PI was only 1 sec. When the PI 
was longer, complete recovery occurred. Within the context of these two compar- 
ison groups an extraordinary effect was again shown in the process schizophren- 
ics. For them, the intrusion brought about a slowing which followed a specific 
pattern throughout the 1 to 9 sec range of PIs. As might be expected, the deficit was 
greatest after a PI of only 1 sec, and then it became less. At PIs of 5 and 7 sec, the 
deficit was minimal. Then at 9 sec the performance was again debilitated. This 
shifting of the debilitation, being greatest at 1 and 3 sec, recovering at 5 and 7 sec, 
and recurring at 9 sec, was correlated with the Elgin scale measure (76) of degree 
(severity) of the process disorder. At the 1 sec PI, the more the process score, the 
slower the reaction time. As the PI delay became 5 and 7 sec, the correlation 
became negative; the more the process, the shorter the reaction time. Then with a 
PI of 9 sec, the correlation was back in the positive direction. The findings just 
described were then replicated in a second sample of 16 process schizophrenic 
subjects.” Thus some confidence may be attached to these unique findings. 

For the first time tn history the length of the deficit following mild intrusive 
stimulation was traced. While the deficit would not be expected over minutes or 
hours, a momentary deficit in the order of seconds does exist. Is the momentary 
deficit cyclic in nature? Or does the process schizophrenic overrespond to the 
distraction in the immediate short PI range and then, in an attempt to dampen his 
responsivity, does he “over-damp?” That is, after reaching a state of optimal 
performance, does he then become underresponsive? If so, the early deficit ob- 
served would result from overresponsivity to distracting stimuli, and the late 
deficit would result from underresponsivity to relevant stimuli. Whatever the 
explanation, the greater the Elgin process score, the greater the phasic shift. 

While the “mild intrusion” study just described was being done, Green (30) 
became interested in the capacity of schizophrenics to handle stimuli at higher 
intensity levels. Rosenbaum, Mackavey & Grisell (58) had shown schizophrenics 
to improve in reaction time with high intensity stimulation, yet Steffy & Galbraith 
(68) were finding the disabling effect. Green examined the effects of (a) pulses of 
loud and soft tones delivered just prior to the reaction time task and (b) different 
intensities of imperative stimulus tone. In accord with the more subtle stimulation 
given by Steffy & Galbraith, Green found that the preliminary irrelevant stimu- 
lation of greater loudness produced a slowing of reaction time. However, in line 
with Rosenbaum et al, increased intensity of the imperative stimulus tone short- 


5 One of the lessons learned by recent schizophrenia researchers is that the small Ns with 
replication is wisely chosen practice. Careful subject classification is so important and the 
availability of subjects within any subclass is so limited that the attempt to collect a large N 
will often either strain the criteria for careful subject selection or the resources of research 
subjects available. Replication becomes important to compensate for the small sample size. 
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ened the reaction time. Again, as in the other Waterloo studies reported, the effect 
occurred only in process schizophrenics. Repeated testing from one-half to one 
day later revealed no trace of the effects of the preliminary disrupting stimulation. 
In fact, recovery was beginning to occur from the prestimulation by the time 25 
reaction time trials had been administered. Green repeated the experiment with 
bright versus dim light flahses for the preliminary disrupting stimulation, and he 
obtained the same results as with the tones. 


THE HEART RATE STUDIES The heart rate interpretation by Lacey (42) suggests 
itself as relevant to the redundancy formulation in schizophrenia. Heart rate 
deceleration, being associated with orientation, possibly could be viewed as low 
redundancy (high-rate processing). Heart rate acceleration, being associated with 
either fear or tuning out of external stimuli, could be associated with high redun- 
dancy (low rate processing). If so, heart rate should have some relationship to 
the redundancy-related deficits in schizophrenia. 

Bradley (12), another student in Steffy’s Waterloo group, examined heart rate 
changes occurring in process schizophrenics and normals during the preparatory 
interval in a reaction time task. A PI of 6 sec in a regular series was compared with 
a PI of the same length in an irregular series. Normal subjects, as reported by 
Obrist and co-workers (54), show a deceleration in heart rate during the PI. 
Bradley found this deceleration to occur in normals for both the regular and 
irregular series. In process schizophrenics a striking phenomenon occurred. In the 
irregular series, a detectable deceleration was apparent throughout the 6 sec. In 
the regular series, however, the heart rate oscillated in a marked acceleration- 
deceleration pattern. As expected, in normals the deceleration and the regular 
series were associated with faster reaction times. The irregular series were associ- 
ated with slower reaction times. For the process schizophrenics the heart rate de- 
celeration and irregular series were associated with faster reaction times, and the 
acceleration-deceleration pattern and regular series were associated with slower 
reaction times. 

These results support the notion that heart rate indeed is a mediating or corre- 
lating factor of stimulus redundancy. More important, the results suggest the 
possibility that the moment-to-moment changes in redundancy-related deficit 
may be predicted on the basis of heart rate. Why heart rate? No one knows. 
Independent of the studies of regular versus irregular series, schizophrenics have 
been shown to become slower and normals faster in reaction time when they are 
given relevant information about the task (e.g. 19). One possible interpretation, 
suggested also by Kaplan (37), is that process schizophrenics feel distressed when 
they are called upon to respond adequately, and increased information only 
heightens the feeling of being “on the carpet.” Heart rate may therefore accelerate 
momentarily because of the fear of having to deal with relevant information. 
Heart rate deceleration may result from bodily processes slowing down so that 
biologic conditions are optimal for processing information. If this interpretation is 
correct, a change in redundancy level could change heart rate, but a change in 
heart rate would not necessarily change redundancy level. 
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Warner (73), in a masters thesis supervised by Broekema at the University of 
Wisconsin, studied heart rate acceleration and deceleration during the presenta- 
tion of a standard stimulus in a size estimation task. Acute good premorbid 
paranoid schizophrenics, poor premorbid nonparanoid schizophrenics, and alco- 
holic control subjects were included in the study. The study confirmed earlier 
results (20, 50) that size overestimation occurred in the poor premorbid nonpara- 
noids and underestimation occurred in the good premorbid paranoids. As ex- 
pected, heart rate deceleration during the presentation of the standard stimulus 
was associated with size underestimation in schizophrenics. However, the same 
degree of deceleration was associated with veridical estimation in the alcoholics. 
While the alcoholics showed consistent cardiac deceleration, the decelerative 
response was very small in the schizophrenic group. A greater correlation was 
obtained between heart rate acceleration and size overestimation in the schizo- 
phrenics. Again, the acceleration present in the alcoholics’ heart beats was not 
associated with size overestimation. 

McCormick (46), in a doctoral dissertation supervised by Broekema, studied the 
same variables as Warner (73) as a part of a larger study. In this study even in the 
alcoholics the size overestimation responses were associated with cardiac acceler- 
ation and size underestimation responses were associated with cardiac decelera- 
tion. Groupwise, the size estimation results were consistent with previous findings: 
paranoids underestimated; nonparanoids overestimated; and alcoholics averaged 
out to no difference from veridicality. However, the decelerative response was this 
time more pronounced in the schizophrenics. Greater cardiac deceleration in the 
paranoids was associated with size underestimation, as expected. A new finding 
was that greater cardiac deceleration in the nonparanoids was associated with size 
overestimation. This finding, as compared to the Warner data, places new interest 
on the subject and stimulus parameter differences which occurred between the 
Warner and McCormick studies. Warner tested schizophrenics on phenothiazine 
drugs; McCormick tested schizophrenics immediately on admission who were 
verified to be on no drugs. Warner examined heart rate during 10-sec exposure 
times of the standard size estimation stimuli; McCormick examined heart rate 
during only a 6-sec standard stimulus presentation. These differences in param- 
eters leave the reader with some provocative hypotheses: (a) phenothiazines 
may influence the accelerative and decelerative response limbs of the heart rate 
orienting response, especially in process schizophrenics; (b) the cardiac accelera- 
tion, found by Warner to be associated with size overestimation in poor premorbid 
nonparanoids, may occur primarily after the 6-sec standard stimulus presentation 
as given by McCormick. If so, this would conform to the interpretation of the 
Steffy & Galbraith results (68) that an optimal orienting period around 5 sec is 
followed by “damping” around 9 sec following stimulus onset. 

Since 1964, the subclassification approach to schizophrenia has been the most 
useful tool available to control and account for the great variance in schizophrenic 
behavior. With the constitution of extreme groups in the Warner (73) study, this 
proposition still holds true. That is, the subject classification (reactive paranoid, 
process nonparanoid) accounts for more of the variance in size estimation than 


ASSESSMENT OF SCHIZOPHRENIA 613 


does heart rate acceleration and much more than heart rate deceleration. How- 
ever, perhaps the door is now being opened to an era where the prediction of 
schizophrenic pathology (and nonpathology) can soon be done better by the 
monitoring of concurrent organismic variables than by the use of subtypes to 
predict group differences. 


SCHIZOPHRENIC LANGUAGE As mentioned earlier, the skeptic may still ask: 
“What does all of this have to do with schizophrenia? You can fiddle around with 
these miniscule variables for ages, but they may bear no relationship or use in what 
we recognize as the schizophrenia in schizophrenics.” Stated differently, schizo- 
phrenics do not get arrested or hospitalized for size estimation error or heart rate 
change any more than they do for slow reaction time and crossover. Indeed, no 
evidence yet exists that these variables have anything to do with etiology or 
intervention in schizophrenia. A recent study of schizophrenic language, however, 
suggests that some rapprochement of the laboratory and clinical aspects may soon 
be possible. 

Cohen, Nachmani & Rosenberg (14) have reported a study in which the prob- 
ability of schizophrenic language was influenced through the manipulation of 
briefly presented stimuli. The study was stimulated by a self-editing theory of 
schizophrenic language (59). According to this theory, people select appropriate 
words for their referential communication by examining the element to which 
they are referring, developing an association to that element from their own 
response repertory, and comparing the association to irrelevant elements. If the 
association is appropriate to relevant and not to irrelevant elements, it is emitted as 
part of the communication; if the association is inappropriate to distinguish the 
relevant from irrelevant elements, the association is rejected. In the latter case, a 
new association of lower dominance is called forth, and the cycle is repeated. The 
theory holds that the difficulty in schizophrenic language is in the inability to reject 
an inappropriate association, once elicited, in order to recycle and search for a 
more appropriate one. 

Cohen et al (14) used a referential discrimination paradigm with color discs to 
examine the self-editing formulation. Two or more color discs were shown, one 
designated as the target. The subjects were instructed to read into a tape recorder 
a description of the designated disc so that a listener could pick it out. When the 
hues on the color discs are widely separated, the dominant association usually 
discriminates adequately and minimal self-editing is necessary. Under these con- 
ditions normal and nonparanoid patient responses were found to be indistin- 
guishable in communication accuracy, reaction time, and utterance length. For 
example, when a green and a blue disc were shown and the blue disc was the 
target, both normals and schizophrenics gave responses such as “blue,” “the 
bluer,” “the blue one,” etc. 

When the color distinction was more difficult, such as with two very close shades 
of pinkish salmon, then a situation was created where the most dominant asso- 
ciation, such as “pink” or “salmon,” was inappropriate. It had to be edited out in 
order to allow a “second take.” Although both groups increased the number of 


614 CROMWELL 


spoken words during a difficult discrimination, normals showed evidence of hav- 
ing rejected their initial associations, made finer distinctions, and then emitted an 
appropriate association. For example: 


“My God, this is hard. They are both about the same except this one might be a little 
redder.” 


“Both are salmon-colored. This one, however, has more pink.” 


By contrast, the schizophrenics, after making the initial dominant association (and 
possibly discovering the difficulty of the situation), were generally unable to reject 
it, go back to process more information, and select an appropriate discriminative 
association. For example: 


` 
“A fish swims. You call it a salmon. You cook it. You put it in a can. You open a can. 
You look at it in this color. Salmon fish.” 


“This is the color of a shit-ass bowl of salmon. Mix it with mayonnaise. Then it gets 
tastey. Leave it alone and puke all over the fuckin’ place. Puke fish.” 


Cohen et al (14) interpreted these results as supporting a perseverative-chaining 
model of schizophrenic language. Emphasis was put on inability to disattend (17) 
from immediate (60) stimuli. The relevance of these results to the redundancy 
hypothesis may also be interpreted. Once the schizophrenic fails to edit, he not 
only begins perseverative chaining, but he also ceases to process the relevant 
. discriminative information. It is as if once he discovers his dominant association 
does not meet the task demand, he “pulls the window shades down” on the outside 
world and starts associating to his own associations. As might be expected from the 
redundancy formulation, Cohen et al reported that these findings might’not be 
obtainable from paranoid (low redundancy) schizophrenics. One cannot help but 
wonder whether these circumstances which produce schizophrenic language are 
related to the size estimation, reaction time, illusion, and heart rate findings 
previously discussed. 


Conclusion 


In this chapter a number of schizophrenia-related deficits have been described. 
Each one followed what Kraepelin referred to as a “briefly presented stimulus.” 
While the time relationships differ, it would appear that schizophrenic deficit may 
well be studied and better understood during the few seconds and fractions of 
seconds which follow carefully constructed novel, ambiguous, decision-demand- 
ing, or otherwise difficult stimuli. Reaction time, size estimation, illusion, GSR, 
heart rate, and language are some of the behaviors affected. 

Various milestones in the history of assessment of schizophrenia, some benefi- 
cial and some not, have been described briefly. The past decade has shown how 
effective but fairly crude subclassifications of schizophrenia have added to our 
understanding. A new era of schizophrenia research may be starting. We may now 
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begin identifying mechanisms by which behavior from one instance to another in 
the same patient, as well as differences between patients, may be predicted. 

Perhaps the great tragedy in the study of schizophrenia in the past eight decades 
has been that the interview to study clinically manifest symptoms has been the 
major tool. In this chapter we have seen how these manifest symptoms are in- 
fluenced and obscured by the cognitive style, social competence, and preexisting 
personality of the individual. We have seen that premorbid history rather than 
clinically manifest symptoms is more relevant for predicting outcome of hospital- 
ization and drug treatment. Advances in understanding etiology may also require 
that we turn away from our preoccupation with these clinically manifest symp- 
toms, from our preoccupation with the necessity of interview and projective 
techniques. The observation and concern with coughing is clinically relevant in 
deciding that a person needs relief from respiratory misery; however, the classifi- 
cation and factor analysis of coughs does not permit an understanding of the 
etiology of a respiratory infection, its definitive treatment, or its prognosis. While 
schizophrenia is unlikely to be explained by this model, the disorder has indeed 
been susceptible to such a misguided orientation of study. The laboratory mea- 
sures of reaction time and other factors on a more molecular level of behavior 
have often been dismissed as not clinically relevant. The time has come to accept 
the fact that what is clinically relevant may not be etiologically relevant. 

The clinically manifest symptoms will remain relevant and important in judging 
whether an individual will disturb other people with his madness, will be able to 
cope, will adapt to his community supports. The more subtle manifestations of the 
illness, such as have been reviewed here, may be what is crucial for purposes of 
differential treatment, prognosis, and etiology of the disorder. 

Once schizophrenia is more completely assessed in these subtle deficits, then the 
concept may change. Rather than Bleuler’s notion of a clinical entity, some people 
having it and some not, we may shift our concerns to specific mechanisms which 
are diffusely present to some degree among the well functioning segment of the 
population. These mechanisms, which may indicate risk for showing the more 
socially disturbing features, may be at some degree of disorder from time to time 
in all of us. 
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FOREWORD 
Joseph Zubin 


This review consists not of the contribution of a single author, but of several of the 
staff members of the Biometrics Research Unit (BRU) of the New York State 
Department of Mental Hygiene. Since we believe that such fields as abnormal and 
clinical psychology can no longer be reviewed adequately by one worker, this 
review may mark a turning point in future reviews of similar fields. The BRU was 
established in 1956 with the mandate to provide an objective assessment of the 
behavior of mentally disordered individuals with the view of improving diagnosis, 
prognosis, and evaluation of outcome of mental disorders, as well as promoting 
investigations into etiology and prevention. This chapter reviews some of the 
accomplishments of this endeavor on the part of the BRU staff, as well as by 
similar staffs and research workers elsewhere. 

This review has been limited to the following sections: 

1. Behavioral Analysis—Kurt Salzinger 

2. The Measurement and Classification of Depression—Joseph L. Fleiss 

3. Psychiatric Epidemiology—Barry Gurland 
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4. Attempts to Improve Psychiatric Diagnosis—Robert L. Spitzer and Jean 
Endicott 
5. A Psychophysiological Approach to Psychopathology--Samuel Sutton 
My task is to introduce this survey and then to conclude it with an integration 
summary. 


Introduction 


The field of abnormal and clinical psychology, which at one time was a relatively 
homogeneous entity, has separated into two distinct and only slightly overlapping 
disciplines, with the clinical wing far surpassing the abnormal in number of 
adherents and applications. Abnormal psychology can still be regarded as the 
basic experimentally oriented part of the endeavor, while clinical psychology may 
be viewed as the applied field largely invested in treatment and application (292). 
The apparent impotence of the standard clinical tests in prognosis (280, 295), their 
low reliability, their lack of value as independent instruments in diagnosis, and 
their dependence for validity on the criterion provided by the clinical interview, 
has forced BRU into abandoning the use of routine clinical tests in research on 
diagnosis and prognosis and evaluation of treatment. Instead we turned our 
attention to the clinical interview itself—our ultimate criterion of psychopatholo- 
gy--and developed a series of systematic structured interviews which have proved 
_ to be reliable and valid (247, 252, 253, 255, 256). The interview techniques have 
served as a basis for making cross-national comparisons, as described in the 
section by Gurland; as a basis for computer diagnosis, as described in the section 
by Spitzer & Endicott; as a basis for developing a newer anatomy of psychopath- 
ology in the form of dimensions of psychopathology, as described in the section by 
Fleiss. 

With one leg up on the problem of diagnosis, we then turned our attention to the 
problem of more direct examination of the behavior of the patient. Following in 
the footsteps of general learning theory, Salzinger in his section applies the 
behavioral analysis approach pioneered by Kanfer and Saslow to the salient 
features of psychopathological behavior exhibited by the patient. Sutton, on the 
other hand, makes a more atomistic analysis of the laboratory-elicited behavior of 
the patient in his physiological, sensory, perceptual, psychomotor, and conceptual 
responses. To elicit these responses he applied the framework of the Men- 
deleev-like table of Burdock, Sutton & Zubin (19), in which the following types 
of stimuli are applied: idling state (when no stimulus is applied); energy stimulus 
(when some relationship exists between intensity and response, e.g. shorter latency 
for more intense stimuli); and signal stimuli (in which the response bears little 
relationship to the stimulus intensity but is related to the signal value of the 
stimulus based on either prior experience or wired-in propensities). In his section 
he develops the rationale for utilizing the paradigm of the Mendeleev-like table, 
offers certain research strategies to overcome the “orneriness” of mental patients, 
and demonstrates their use in differential diagnosis. . 

Up to this point we have dealt mostly with the description of abnormal behav- 
ior. Description, however, is not enough, for it never cured a disorder, nor has it 
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of its own accord yielded any insight into the nature of the disorder. To go deeper 
we must appeal to etiology, even though our knowledge of etiology is skimpy, if 
not entirely lacking. To make up for our ignorance we must resort to “as if” causes 
in the form of scientific models. However, mere postulation of models is also not 
enough; we must proceed further to draw from them inferred hypotheses and test 
them in the crucible of observation and experimentation. This part of our endeav- 
or is still in its preliminary stages, and we shall deal with it in the summary at the 
end of this chapter. 


l. BEHAVIORAL ANALYSIS 


Kurt Salzinger! 


Spurred on by the fact that psychiatric diagnosis continues to suffer the pains of 
low reliability and validity, behaviorally oriented therapists have started to codify 
their own kind of behavioral analysis. This analysis has been applied to both kinds 
of behavior therapies: that based on the classical or respondent conditioning 
paradigm and that based on the instrumental or operant conditioning paradigm. 
This is not to say that in practice one kind of conditioning occurs without the other. 
On the contrary, the two always co-occur (212). What differentiates the therapies is 
the focus of the therapist. It determines the kind of behavioral analysis engaged in 
and, therefore, also the kind of assessment procedure used. 

The main business of the therapist following the respondent paradigm is the 
identification of the stimuli that act as conditional stimuli (CS) for the patient’s 
current state. When using desensitization, the therapist must, according to Wolpe’s 
(286) original formulation, also establish a hierarchy of aversiveness of stimuli in 
order to be able to present the CSs in an increasing order of anxiety productive- 
ness. 

The operant conditioner, on the other hand, has the task of identifying the 
discriminative stimuli (SP) in the reinforcement contingency (SP... R — SR) that 
precede the undesirable or desirable behavior (R), and those reinforcing stimuli 
(primary S¥s and conditioned S's) that follow them; in addition, the operant 
conditioner must estimate the strength of those behaviors examined within the 
reinforcement contingency. 


Distinctive Features of Behavioral Analysis 


Behavioral analysis focuses on those aspects of the individual and his environment 
that can be directly used in therapy. Respondent conditioning emphasizes the 
autonomic responses, while the operant paradigm emphasizes the behavior of the 
patient that directly engages the external environment. Behavioral analysis is 
functional, not topographical. More important than the presence of a fear are the 
conditions (the particular CSs) that elicit the fear (respondent conditioning). More 
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important than the individual’s fighting behavior are the conditions under which it 
takes place, namely when, where, and to what effect (operant conditioning). 
Behavioral analysis classifies behaviors rather than people. This is not to say, 
however, that behavior therapists restrict their work to the analysis of single 
behaviors. They take into account all relevant behaviors. For example, if such 
therapists discover that a particular individual is currently reinforced for 
undesirable behavior, they determine in addition what reasonable behavior the 
individual has in his repertory; then, if it is thought advisable, that person can be 
retrained to emit the reasonable behavior to obtain the positive reinforcers earlier 
obtained by undesirable behavior. 

Finally, while diagnosis is supposed to shed light on the cause of disorders 
(actually it rarely does so for psychiatric disorders), the relationship of behavioral 
analysis to the cause of the disorder is relatively unimportant. More significant 
than how the behavior started is what is maintaining it. In some cases this amounts 
to knowing the cause, but not usually. On the other hand, behavior analysts do 
assume that most types of behavioral abnormalities are produced through one of 
the learning paradigms. Even though some particular physical insult may be 
vitally involved in a particular abnormality, the interaction of that physical insult 
with the conditioning paradigms ‘is held responsible for the current abnormal 
behavior (see 213 for this kind of explanation of the immediacy hypothesis and 
schizophrenic behavior). 


The Behavior-Analytic View of Testing 


Goldfried & Kent (76) and Mischel (159) explain that test responses are regarded 
as signs of behavior by the traditional approach and as samples of behavior by the 
behavior theory approach. Basically, behavior analysts prefer direct observation 
of the individuals in the actual situation, but they accept the creation of an 
experimental analog situation, including role playing and the use of self-report 
techniques. In the better assessment programs, the patient is taught how to observe 
his or her own behavior in an objective manner; unfortunately, more often than 
should be true, self-report is obtained through the clinical interview, which is beset 
by the artifacts of influence on the part of the interviewer (217-219). 

The traditional approach to testing also carries along with it the assumption that 
the test responses reveal behavior patterns invariant with respect to the situations 
in which they occur. However, as long ago as 1928, Hartshorne & May (92) showed 
that the behavior of individuals depends in large part on the situation (the rein- 
forcement contingency). Behavior is flexible in that it varies with environmental 
stimuli. Only the most extreme abnormality would produce the kind of invariance 
of behavior that traditional tests demand for their validity. 

The use of self-report is often found to be a poor substitute for behavioral 
observation. One study (88, 216) focused on the relationship between communi- 
cability (behaviorally defined) and amount of contact in a group of. people. 
Objectively observed amount of contact was the best predictor of commun- 
icability; self-reported contact was predictive only when considering presence 
vs absence; and self-estimated degree of closeness was not predictive to a sig- 
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nificant degree. If these are the findings with respect to amount of social contact, 
how much greater must be the problem of estimating behaviors more vaguely 
specified, such as anxiety. Tests used to obtain self-report need to be behavioral, 
specific, and close in time to the occurrence of the behavior in question. 


Behavioral Analysis and Environmental Psychology 


Although the reinforcement contingency deals directly with the environment and 
its effect on behavior (see 220 for a review of this subject with respect to the 
measurement of abnormal behavior), a separate field of environmental psychol- 
ogy has sprung up in recent years. Its discoveries might well contribute to a 
behavioral analysis with its methods of objectively describing the “personality” of 
environments (97, 161, 162). The environment has been described by such di- 
mensions as ecology, organization structure, personality of milieu inhabitants, 
behavior settings, reinforcement properties of the environment, and psychosocial 
characteristics. Although only one of the dimensions is conceptualized in terms of 
behavior theory, behavior theory provides the mechanism through which the 
dimensions no doubt influence people’s behavior. The more important point here, 
however, is that the behavior analyst trying to assess the viability of programs of 
reinforcement or extinction should obtain accurate knowledge about the envi- 
ronment in which the patient resides. 


Behavioral Analysis of Traditionally Categorized Disorders 


Dollard & Miller (47) provided an early extensive treatment of neurosis in terms of 
behavior theory. Their analysis, unfortunately, accepted as fact too many of the 
concepts of psychoanalytic theory. More recently, Salzinger (214) showed how a 
number of learning paradigms could produce known entities of abnormal behav- 
ior such as delusions, depression, and psychophysiological reactions. Because of 
the recent increase in interest in the depression syndrome, it might be useful to 
note two new behavioral analyses of depression (55, 160). 


Methods of Behavioral Analysis Related to 
the Classical Conditioning Model 


The basic approach to assessment for Wolpe’s behavior therapy and some of its 
variants consists of the classical psychological procedures, namely the paper- 
and-pencil test. Wolpe (286) led the way by using the Willoughby Personality 
Schedule, a test for neuroticism or unadaptive anxiety reactions. The patient 
responds to such questions as, “Do you get stage fright?” Other questions deal with 
daydreams, being easily discouraged, feeling miserable, feeling inferior, and being 
unable to make up one’s mind. In addition, Wolpe (287) uses the Bernreuter (9) 
Self-sufficiency Questionnaire primarily to assess the degree to which the patient Is 
able to carry out instructions with regard to self-assertion. He also suggested the 
use of the Maudsley Personality Inventory. Finally, Wolpe (287) uses the Fear 
Survey Schedule, which is quite popular in respondent conditioning behavior 
therapy. This test (288) lists situations that might provoke fear in the patient. The 
patient indicates degree of disturbance on a five point scale, ranging from “not at 
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all” to “very much.” The 1969 version lists 87 situations, including noise of 
vacuum cleaners, falling, sirens, failure, sick people, receiving an injection, birds, 
lightning, doctors, people with deformities, dirt, and a lull in conversation. 

The fear-survey schedules used by Wolpe and others have been fully reviewed 
by Hersen (94) and by Wolff & Merrens (285). Tasto & Hickson (268) standardized 
and scaled a 122-item Fear Survey Schedule (FSS). They administered it to a 
group of 132 male and 107 female undergraduate students at a large university. 
Standardization of a test presents the therapist with an objective criterion for 
determining when the fear has been reduced to the level that characterizes most of 
the population. Standardization on a college population, however, is questionable 
in terms of how representative it is of the population at large. In a later paper, 
Tasto, Hickson & Rubin (269) performed a profile analysis for the five factors 
earlier derived by Rubin et al (210). These factors and their corroboration by 
profile analysis give some empirical basis for gauging the generalization to be 
expected from the treatment of one problem to that of another. In 1972, Tasto & 
Suinn (270) administered the same test twice within a 10-week period and found 
generally high correlations, but not uniformly so, for all factors. More trouble- 
some were the significant decreases in the factor scores from the first to the second 
test administration. As the authors point out, this “natural” decrease would de- 
finitely have to be taken into account when estimating the effectiveness of thera- 
peutic intervention by this test. 

Hersen (94) presents additional results on the fear survey schedules. Female 
Subjects, whether undergraduates or patients in inpatient or outpatient therapy 
situations, have higher fear scores than male counterparts. It is not clear whether 
this difference is to be ascribed to differences in fears, to the selection of items of 
the test, or to the difference in tendency to admit to having fears. Normals have the 
smallest number of fears, neurotics the next larger, and psychotics the largest. 
Reliability of the fear survey was very high on a split-half basis but not as high on 
a test-retest basis. Furthermore, schizophrenic subjects showed a significantly 
lower reliability than normals on a snake fear (.53 as opposed to .79 for normals). 

Although the modification of self-assertive responses consists of the condition- 
ing of operant behavior, Wolpe has used it as part of the inhibitory mechanism in 
his reconditioning paradigm, and therefore we shall deal with it here. Rathus 
(202), inspired by the ever greater use of assertiveness training, devised a schedule 
of 30 items for its measurement. Both test-retest reliability and split-half reliability 
correlations were above .77. With respect to validity, Rathus had the testers choose 
a group of people they knew well and whom they could rate on a 17-item rating 
scale. Intercorrelation of the ratings with the subjects’ responses about themselves 
yielded a series of positive values, indicating that subjects who appeared to be 
assertive to friends revealed themselves to be so by their own responses on the 
self-assertiveness scale. Furthermore, the degree of assertiveness shown by sub- 
jects’ stories correlated positively (r = .70) to their scores on the assertiveness 
scale. 

It is not surprising that at least some behavior therapists were not satisfied with 
behavioral diagnoses on the basis of paper-and-pencil tests (69). Rutner & Pear 
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(211) devised a behavioral measure of the avoidance of a feared object by mea- 
suring the length of time a subject is willing to observe a rat pressing a lever on a 
food reinforcement contingency. This length-of-observation measure distin- 
guished high from low phobic subjects as determined by how close they were 
willing to approach a rat in a cage. Craighead (41) related the proximity of a 
subject to a snake to verbal self-report of fear and to level of physiological arousal. 
The closer the subject was able to approach the snake, the less the physiological 
arousal measure and the greater the subjective fear measure. Unfortunately, the 
magnitudes of the correlations were not high. With respect to sexual problems, 
however, specific physiological measures, such as those of penile volume (4, 285), 
are more directly relevant. 


Methods of Behavioral Analysis Related to the Operant Conditioning Model 


Behavioral analysis is an intrinsic aspect of operant conditioning. Goldiamond 
(77) instructed his clients in behavior theory; then he told them to think of their 
own problems as an experiment in which they are trying to modify the probability 
of a particular class of behavior. 

The single most significant effort in the area of behavior analysis stems from 
Kanfer & Saslow (101), along with a later revision by the same authors (102). They 
present the following outline for eliciting information: 1. Analysis of the problem 
situation—is the behavior excessive or deficient? Under this category the therapist 
also looks for the patient’s behavioral assets. 2. Clarification of the problem 
situation—people involved in the behaviors and the consequences of the behav- 
iors. 3. A motivational analysis—a listing of the reinforcers, positive and negative, 
now impinging on the patient, those that could be used for therapy later, and those 
that might change if therapy is successful. 4. Developmental analysis—patient’s 
biological equipment and sociocultural background which may limit behavior, as, 
for example, a limp or blindness, poor education, or different language back- 
ground. These factors and characteristic behaviors at different times of life are 
investigated. 5. Analysis of self-control—inadequate or excessive. 6. Analysis of 
social relationships—the patient’s social networks reveal who can help and who is 
. hindering the patient. 7. Analysis of the social-cultural-physical environment— 
comparison of the patient’s behavior to that expected by someone of that social 
background. Kanfer & Saslow explain that the information to be collected 
for this type of analysis need not all be garnered from the usual patient inter- 
view but, on the contrary, ought to be obtained by talking to significant others in 
the patient’s life, by watching him or her in interaction with those significant 
others, by observing that patient at work and at play, by interviewing people who 
might know about the patient, and also by employing paper-and-pencil tests and 
specific laboratory techniques as they are called for. 

In their book, Kanfer & Phillips (100) successfully relate the particular problems 
a patient might have (as ascertained from the above analysis) to the techniques of 
therapy that should be used, such as extinction and conditioning of competing 
responses. 

Storrow (260) has devised various check lists for his behavior analysis. The 
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Personal Data Form, filled out by the patient, has questions such as: “What 
persons, situations, activities, etc seem to set these symptoms off or make them 
worse?” (p. 219). He completes the Psychiatric Case Study form on the basis of the 
Personal Data Form, a physical examination, information from relatives and the 
patient in the interview, and psychological testing. Although he calls the behaviors 
he wishes to modify symptoms, he views them as the dependent variables to be 
modified, rather than the epiphenomena of the medical model. The final part of 
his form notes his decision of what to change in the patient and by what method. 
Another variation on the Kanfer-Saslow behavioral analysis is the work by Stuart 
(261), in which forms are used following the SP... , R > SR paradigm. He categorizes 
what we have called SPs into four classes: 1. instructional stimuli; 2. discriminative 
stimuli (which Stuart restricts to mean an offer of positive reinforcement for a 
response); 3. potentiating variables which insure that the reinforcer is effective, 
(for example, by depriving an individual of those things that will be given later as 
reinforcers); 4. facilitating stimuli (tools such as books if the subject is expected to 
read). Pomeranz & Goldfried (186) presented another variant of this basic kind of 
outline. Finally, Carter & Thomas (24) presented 27 categories of “problems in 
communication” in marital couples. These are for the most part operationally 
defined and consist of overtalk, fast talk, quiet talk, affective talk, abusive talk, 
opinion deficit, and topic avoidance, among others. 

A very important aspect of behavioral analysis is the use of observation by 
trained personnel, by the patient, or by a significant other. In many studies it has 
been shown that the act of observation by interested parties, whether the patient or 
significant others, increases self-control. In view of the frequent deficit in such 
self-control initially, it is perhaps not surprising that behavior should improve 
merely because it has been observed in an objective manner. Salzinger, Feldman 
& Portnoy (215) trained parents to use a form to record their observations of the 
behavior of their children by noting the children’s responses, their time of 
occurrence, the people present at that time, the occasion on which the behavior 
occurred, and the events that followed that behavior. Typical results showed that 
the observations were too vague at first, but with shaping through discussion of the 
recorded observations, the parents became more specific. At that point the parents’ 
behavior also changed, and occasionally the behavior of the target child improved 
before any special program was enacted. Apparently the requirement to record 
what was happening made the parent hesitate long enough at the time of such 
behaviors as tantrums so as not to reinforce them. In some cases a behavioral 
change is easy to record objectively, as in the case of weight loss. In most others, 
unfortunately, despite the therapist’s best intentions, the only measure available is 
the less easily verifiable patient-recorded behavior. Simkins (234) reported on the 
lack of data concerning reliability of such records. Electronic devices (227) might 
eventually yield indices of validity for such observations. 

One more type of assessment ought to be mentioned. In 1967, Cautela & 
Kastenbaum (25) devised a reinforcement survey schedule on which subjects may 
indicate the extent to which certain items give them “joy or other pleasurable 
feeling.” Examples are: eating, beverages, nude men, nude women, reading, 
singing, skin diving, and being praised by the boss for having completed a job well 
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done. Thorndike & Kleinknecht (272) found that test-retest reliability over 5 
weeks ranged from .40 to .91 for various clusters of reinforcers. It is suggested that 
at least some of the clusters are sufficiently reliable for therapy research. 


Conclusion 


There are now a number of methods of behavioral analysis available. They 
include the usual techniques for obtaining information in the area of clinical 
psychology. They differ in the main from the older techniques in being more 
focused and in requiring fewer assumptions about the meaning of the responses 
obtained. Much more work needs to be done, however, before we can say that we 
have well-standardized methods of doing behavioral analysis. What we have so far 
are some specific tests and some general outlines of inquiry. 


2. THE MEASUREMENT AND CLASSIFICATION 
OF DEPRESSION 


Joseph L. Fleiss? 


After long neglect at the expense of research in schizophrenia, research in 
depression resumed actively in the early and middle 1960s. The research has been 
on many fronts: biochemical (224), epidemiological (131, 233), genetic (181), and 
clinical. Even though the unit of analysis in these research endeavors is invariably 
the individual depressed patient, the results are meant to be applied to depressives 
in general. This means that in each of these kinds of research, evaluation methods 
must be applied to measure a subject’s degree of depression (in order to screen out 
nondepressives as not suitable for study, to describe one’s resulting sample, or to 
measure change), and to classify him into one of a number of diagnostic categories 
(in order to ascertain which kinds of patients one’s results apply to, and to 
communicate one’s results to others). 

This review will be restricted to methods that have been employed to measure 
and classify depression. Attention will be limited as far as possible to so-called 
unipolar depression, i.e. depression as the primary current complaint with no 
previous episodes of mania or hypomania. Research into the distinction between 
unipolar depression and bipolar depression (i.e. manic-depressive illness) has 
been reviewed elsewhere (2, 283), as has been research into the etiology and 
treatment of depression (39, 258). Some of the signs and symptoms used to 
characterize depression are, in addition to a depressed mood, a loss of interest in 
usual activities, an inability to concentrate, psychomotor retardation or agitation, 
suicidal thinking, weight loss, and feelings of hopelessness. 


2 The author is also affiliated with the Division of Biostatistics, Columbia University, New 
York City. 
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The Measurement of Depression 


RATING SCALES Rating scales most frequently employed for measuring depres- 
sion are of three kinds: self-report questionnaires to be completed by the subject; 
scales to be filled out by a professional after interviewing and observing the 
subject; and scales to be completed by a nurse or attendant on the basis of 
naturalistic ward observations. 

Self-report scales for depression have been developed by Beck et al (6), 
Pilowsky & Spalding (184), Rockliff (206), the Wakefield Hospital group (241), 
and Zung (296). The major advantage to relying on the subject to rate his own 
condition is the saving of professional time. Self-report scales are thus economical 
sources of data for large-scale surveys. They have been shown to be sensitive in 
assessing a patient’s degree of recovery from a depressive episode (190, 198), but 
not in estimating the severity of an acute episode experienced by a psychiatric 
inpatient (190). Their usefulness may be greater with outpatients and private 
patients. 

One disadvantage to self-report scales is that some subjects may be unable to 
understand the intent of the questions. Because of denial or deliberate falsifica- 
tion, the severity of depression is frequently underestimated by the subject him- 
self (178, 206, 275). In addition, many newly admitted inpatients are too 
acutely ill even to complete a short questionnaire within a few days of admission 
(250). 

An alternative procedure for measuring depression is a clinical interview of the 
subject, followed by or coincident with the completion of a rating scale. Scales for 
depression to be administered by clinicians or other professionals after an 
interview have been developed by Hamilton (86), Pilowsky and his colleagues 
(184), and Zung (298). Wechsler et al (277) developed a scale which relies on both 
sources, the patient’s self-report as well as a professional’s judgments. 

Spitzer & Endicott (248) have indicated how the interview is superior to other 
methods of data collection in psychopathology. They point out, however, that 
differences among interviewers in the topics they cover and in how they phrase 
their questions, and differences for the same interviewer in how he interacts with 
different subjects, can greatly affect the final ratings. Thus sizable proportions of 
the variance in scores from rating scales based on an interview may be associated 
with differences among interviewers rather than differences among subjects (58). 

In order to reduce this extraneous variation, structure may be imposed on the 
interview by specifying for the interviewer the order of topics to be covered, the 
questions to be asked, or even the probes to be put to the subject. Only Zung’s later 
scale (298), among those specifically designed for depression, seems to incorporate 
a structured interview. 

The self-report and interviewer-completed scales cited above limit the mea- 
surement of depression to that currently being experienced by a subject, and are 
thus useful for sample description and for establishing baselines in the study of 
change. They do not cover previous episodes or other aspects of psychiatric 
history. As pointed out by increasing numbers of investigators (3, 54, 205), how- 
ever, history is at least as important as current state for describing depression when 
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one’s purpose is classification. Only one scale, a structured interview procedure 
developed by Spitzer & Endicott (249), appears to exist for covering thoroughly 
both current and past aspects of depression. The scale also has sections for manic 
and schizophrenic behavior. 

Ratings made by nurses or other ward personnel of patients’ overt behavior 
provide a useful adjunct to ratings based on an interview (18, 198, 199). They may 
even reveal patterns of behavior that are not picked up by other rating methods. 
Raskin and his colleagues (200, 201), for example, found a factor of excitement in 
nurses’ ratings of the ward behavior of depressed patients which was not present in 
either self-report or interview-based ratings. The nurses may have noted either 
transitory manic states or periods of extreme agitation that neither the patients nor 
the interviewers were aware of. 

A limitation on naturalistic observations of ward behavior is that they may be 
applied only to hospitalized patients and not to outpatients or nonpatients in the 
community. In addition, ward personnel may be insensitive to aspects of psycho- 
pathology (e.g. emotional withdrawal) which are revealed by the other sources of 
ratings (199). 

Each source of information—the patient himself, the interviewer, and the 
nurse—provides data about various aspects of depression, but these data are often 
in conflict. The multitrait-multimethod studies of Raskin and his colleagues 
(199-201) represent an attempt to dovetail these observations into a coherent 
portrait of depression. 

Although most of the scales cited above are intended to describe the depression 
currently experienced by psychiatric patients, some were designed for use with 
nonpatients (296, 298), and at least one was especially designed for describing 
course of illness (18). Before selecting a scale, therefore, the investigator must 
specify the kinds of subjects he intends to study, who is to make the ratings, and 
whether the data are to be analyzed cross-sectionally or longitudinally. 


SLEEP DISTURBANCE IN DEPRESSION A stereotype characterizes the place of sleep 
disturbance in depression. Neurotic depressives are supposed to experience diffi- 
culty in falling asleep, whereas psychotic depressives are supposed to experience 
early morning wakening. See below for a review of the distinction between 
neurotic and psychotic depression. A number of studies (85, 117, 209) have 
confirmed this classic picture. In a review of six studies contrasting the symptoms 
of neurotic and psychotic depression, Costello (38) found that early morning 
wakening was one of only two signs agreed upon by at least four of the studies as 
characterizing psychotic depression. 

Other studies (22, 40, 150, 153), however, have failed to confirm this stereotype. 
A major source of discrepancy between studies is a reliance on different kinds of 
information. Some studies have relied on the patient’s own recollection of the 
quality of his sleep (22, 85, 117), and others have relied on the observations made 
by nurses throughout the night (40, 150). The latter kind of study has tended to be 
negative, reporting small if any differences between neurotic and psychotic de- 
pressives. 

Two other studies (168, 185) have relied on both the patient and the nurse as 
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sources of data, but their results disagree. One (168) reported good agreement 
between nurses’ observations and patients’ reports ofthe quality ofsleep, and the 
other (185) that patients tend to report their sleep to be poorer than nurses 
perceive it. Weiss and his colleagues (278) found that neither nurses nor patients 
were able to agree well with information provided by electroencephalographic 
studies. | 

The definitive description of the patterns of sleep disturbance in depression will 
only come from all-night EEG studies. Kales & Berger (99) reviewed the relatively 
small number of such studies performed through 1969. Their review suggests that 
neurotic and psychotic depressives have similar patterns of sleep disturbance, with 
the disturbances perhaps being more severe in the psychotic than in the neurotic 


group. 


RATER PERCEPTIONS Even ifstructure is imposed on the clinical interview, which 
is the most important source of information about depression, extraneous varia- 
tion will remain due to differences in how raters perceive what the subject reports 
and how he behaves (58). In one study (184), for example, ratings of depressed 
patients made by medical students relatively unsophisticated in psychopathology 
correlated much more strongly with patients’ self-reports (r = .91) than did rat- 
ings made by either experienced psychologists (r = .40) or psychiatrists (r = .36). 
The higher correlation achieved by the students may be attributed either to their 
being less influenced than the professionals by theoretical formulations, or to their 
naively accepting as fact, without further probing, whatever the patients reported. 

Reliance on standardized and structured procedures for assessing depression 
assures a reduction in, although not an elimination of, both bias (which might 
result in spurious differences) and unreliability (which might obscure real differ- 
ences) (108). Such reductions are especially likely in the context of a single study, 
for the raters can be trained in the comparable use of such procedures. 

The theoretical background of the rater is known, however, to affect strongly 
how he perceives and rates depressive symptoms (38, 51, 107, 108, 156, 179). No 
amount of standardization and structuring, therefore, will guarantee comparabil- 
ity across studies if the personnel employed in each perceive these symptoms 
differently. 


The Classification of Depression 


RELIABILITY OF DIAGNOSIS The major diagnostic subdivisions of the unipolar 
depressive disorders are neurotic depression, psychotic depression, and involu- 
tional melancholia (1, 74). On the basis of evidence reviewed by Zubin (291), as 
well as more recent data (37), one must conclude that the routine diagnosis of 
depression is grossly unreliable. 

In the original reports, data on diagnostic agreement were presented as simple 
proportions of agreement, uncorrected for the degree of agreement expected by 
chance alone. Because chance-expected agreement varies across studies as a 
function of the base rates with which the diagnoses are made, the uncorrected 
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proportions are not comparable. A statistic, kappa, has been developed for mea- 
suring how well two (32) or more (59) judges agree beyond the level predicted by 
chance. 

Values of kappa less than or equal to zero imply that agreement is no better than 
that predicted by chance, and values greater than zero imply that agreement is 
better than that predicted by chance. Kappa assumes its maximum value of + 1 
only when agreement is perfect, and is interpretable as an intraclass correlation 
coefficient (61, 127). 

Three studies (7, 37, 221) report sufficient data on the reliability of the diagnosis 
of depression to permit the calculation of kappa. For neurotic depression, kappa 
varies from .10 to .47; for psychotic depression, from .19 to .30; and for involu- 
tional melancholia, from .21 to .38. When the subcategories are combined into the 
single diagnosis of depression, the resulting mean value of kappa is only in the 
neighborhood of .30. 

Serious cross-national differences exist in the criteria for diagnosing depression 
(37, 110, 230), with British psychiatrists tending to diagnose depressive disorders 
relatively more frequently than American psychiatrists. Equally serious are the 
biases against diagnosing depression in lower class patients in general (131) and 
blacks in particular (98, 144, 235). When blacks and whites are assessed by means 
of standardized procedures, the difference in the prevalence rate of depression all 
but vanishes (236, 273, 297). 

A study of New York hospital psychiatrists (83) revealed that they were sensitive 
to the depression exhibited by some of their patients, but were reluctant to bring 
this sensitivity to bear in making their diagnoses. A sample of patients was 
identified on the basis of a standardized interview conducted by independent 
investigators as suffering from severe depression, but with no signs of schizo- 
phrenia. These patients were treated with antidepressive therapy by the hospital 
psychiatrists significantly more frequently than were other kinds of patients, but 
were nevertheless more frequently diagnosed schizophrenic than depressive. 

The inability of clinicians to agree well on the diagnosis of depression, the 
systematic bias against diagnosing depression in certain kinds of patients, and the 
tendency among American psychiatrists to underdiagnose depression must be 
borne in mind as major reasons for disagreements among the studies to be 
reviewed below. 


THE DICHOTOMY BETWEEN NEUROTIC AND PSYCHOTIC DEPRESSION Of the three 
major subdivisions of the unipolar depressive disorders, involutional melancholia 
stands out as apparently not meriting status as a separate diagnostic entity. With 
the trivial exception of age, few differences of practical magnitude in etiology, 
clinical characteristics, or outcome have been reported between men and 
other depressives (2, 43, 106, 155). 

The validity of separating the other two subdivisions, neurotic depression and 
psychotic depression, however, has been the subject of intensive research and 
debate. An alternative dichotomy, reactive versus endogenous depression, over- 
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laps the neurotic-psychotic dichotomy to such an extent (5) that neurotic depres- 
sion and reactive depression are virtually synonymous, as are psychotic depression 
_ and endogenous depression. 

Reactive or neurotic depression is presumedly characterized by the existence of 
a precipitating event, anxiety in the clinical picture, difficulty in falling asleep, and 
poor premorbid adjustment. Endogenous or psychotic depression is presumedly 
characterized by the absence of a precipitating event, retardation, early morning 
wakening, and a history of previous depressive episodes. Personality variables 
appear not to be useful in distinguishing between the two types (68), nor perhaps is 
the existence of life-changing events prior to the onset of illness (271). 

Some investigators have either suggested (105, 154, 156) or reported data which 
suggest (116, 174) that psychotic or endogenous depression is a tightly organized 
syndrome representing a well-defined illness, whereas neurotic depression is a 
term applied to a heterogeneous collection of signs and symptoms frequently 
found together with other neurotic features, personality disturbances, and hys- 
teria. 

The validity of this view has been brought into question by Costello’s review 
(38) of six studies (22, 87, 117, 208, 209, plus an unpublished report) which 
factorially separated features of neurotic from features of psychotic depression. 
He found that the studies tended to agree on the variables loading on the factor for 
neurotic depression, but did not agree on the variables loading on the factor for 
psychotic depression. Only psychomotor retardation was reported by all’ six 
studies as a symptom loading high on psychotic depression. Early morning 
wakening was cited as an important feature by four studies, and all other fea- 
tures were cited by three or fewer studies. 

The overwhelming importance of retardation in distinguishing between differ- 
ent kinds of depressed patients has been confirmed by other studies (62, 170). It is 
crucial, however, that psychomotor retardation be measured separately from flat 
affect (62), something not possible with all rating scales (142, 256). 

In addition to the studies reviewed by Costello (38), others (104, 116) have 
found that ratings of depressed patients factor either into a bipolar factor, with 
features of the two kinds of depression loading on opposite ends, or into two 
factors, one defined by signs of neurotic and the other by signs of psychotic 
depression. The inference frequently drawn from such results has been to confirm 
the validity of the diagnostic separation between neurotic and psychotic depres- 
sion. Such an inference is erroneous, for factor analysis is not by itself informative 
about the existence of types (51, 108, 120, 163, 207). 

One criticism leveled at the studies reporting bipolar factors has been that 
bipolarity may result from raters’ perceiving the two kinds of depression as 
distinct, and thus tending to rate patients as exhibiting either signs appropriate to 
one type or signs appropriate to the other, but rarely signs appropriate to both (51, 
107, 156). When factor analyses have been applied to ratings made by profes- 
sionals not committed to the dichotomy between the two kinds of depression, 
neither bipolar factors nor separate factors for neurotic and psychotic depression 
have been found (148, 165, 199). 
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More germane to the validation of a typology than the results of factor analyses 
is an examination for clustering or multimodality of the frequency distributions of 
patients on quantitative variables (87, 163). Increasing numbers of investigators 
have been examining and reporting distributions of both factor scores and 
discriminant function scores best distinguishing neurotic depressives from psy- 
chotic depressives, but the results have been contradictory. Some studies have 
reported bimodality (22, 53, 73, 222), others have reported unimodality (105, 106, 
111, 170, 179, 207, 271, 294), and yet others have reported equivocal results (104, 
116). The same biases which give rise to bipolar factors may give rise to bimodal 
distributions, and such biases may explain some of the positive findings of bimo- 
dality. 

Another reason—a statistical one—may account for some of the negative and 
equivocal results. Multimodality is not a necessary condition for the existence of 
subtypes. Unless two underlying distributions are well separated or are present in 
nearly equal proportions, their mixture will result in a unimodal but skewed 
distribution (67). It is therefore inappropriate to examine a frequency distribution 
only visually, or to apply only goodness-of-fit tests for normality. A more powerful 
but more complicated procedure is to test whether a mixture of normal distribu- 
tions fits a given distribution significantly better than does a single normal distri- 
bution (60, 284). If the factor or discriminant function scores are weighted sums of 
sufficiently many variables, the model of underlying normality becomes tenable 
by the central limit theorem (60). 


NUMERICAL TYPOLOGIES OF DEPRESSION Rather than looking for modes in a 
frequency distribution as evidence for the existence of types, other investigators 
(80, 173, 174, 182, 183) have applied methods of numerical taxonomy (70, 141, 
163) to cluster depressed patients directly into homogeneous subgroups. Although 
some of these numerical typologies have been validated by demonstrations that 
members of different types respond differentially to treatment (96, 175) or perform 
differentially on other tests (176, 184), the results must be suspect because of 
deficiencies in most clustering methods (50, 63, 67), especially their tendency to 
yield artifactual types. Thus the demonstration of differential response to treat- 
ment may be little more than a demonstration of a correlation between response to 
treatment and one or more of the variables (e.g. retardation) most important in 
effecting the apparent separation between patients (60, 170). 

One of the apparently more promising typologies of depression is that derived 
by Paykel (174). He found four types, one corresponding to psychotic depression 
and the others representing subdivisions of neurotic depression. A major difficulty 
in interpreting these types is that Paykel’s sample of depressives was selected from 
different sources: outpatient clinics, emergency services, day care centers, and the 
wards of mental hospitals. The importance has been recognized of the effects of 
social and cultural variables on the type of treatment patients seek (121) and on 
the patterns of psychopathology they present (103, 131). It was hardly surprising, 
therefore, that differences in the patterns of depression were found among patients 
from the various sources of Paykel’s sample (120, 177), and therefore that his four 
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types overlapped considerably with the sources of his sample (176). One cannot 
tell, therefore, whether his typology is one of depressed patients or of facilities 
treating depressed patients. 

Aside from the defect in Paykel’s sample selection, his work represents a 
paradigm for future research. Standardized observations of mental state and 
history should be made by professionals not committed to any theoretical model 
of depression, and should be used according to specified rules (205) to select 
subjects whose primary disorder is depression. 

The resulting data should be subjected to the most powerful analytic methods 
available. The Friedman-Rubin clustering method (70), the one used by Paykel 
(174), seems to be one of the best available. Within any study, however, one’s 
sample of diagnosed depressives must be derived from one source only (e.g. public 
mental hospitals or a probability sample of a community) rather than from many 
sources. It is better to describe one particular kind of depressed patient precisely 
than to describe many kinds of depressed patients ambiguously. 


3. PSYCHIATRIC EPIDEMIOLOGY 


Barry Gurland 


The generally healthy state of psychiatric epidemiology is evidenced by the tone of 
the review papers that have recently appeared on this subject (34, 36, 81, 123), by 
the scientific rigor of the papers in the international conference in Aberdeen (90), 
and by the appearance of the first textbook devoted exclusively to this topic (35). 

The Symposium on Psychiatric Epidemiology, held at Aberdeen University in 
July 1969, was the first of its kind in nearly a decade and produced almost 20 
papers of high standard, published as a volume in 1970 (90). Gruenberg (81), 
looking back to the pioneering Milbank Memorial Fund meeting on psychiatric 
epidemiology in 1949, and forward to developments in the near future, concludes 
that “the epidemiology of mental disorders has become an established category of 
research in the related scientific disciplines,” and he regards it as now inappro- 
priate, for epidemiological purposes, to view mental disorder in isolation rather 
than as one of the consequences of a variety of conditions, since the latter view 
provides a more useful unifying theme. Thus, in a wry compliment to the Aber- 
deen proceedings, he sees no further necessity for such meetings. Although that 
may be true for substantive studies, the methodological problems of assessing 
mental disorder seem sufficiently peculiar to this class of disorders to warrant its 
delineation as a special field for some time to come. 

Among the major recent advances in the methodology of psychiatric epidem- 
iology has been progress towards the development of reliable and valid mea- 
sures for assessing psychopathology in different national or cultural groups and 
across language barriers; for recording the interaction between patients and 
treatment services by means of case registers or record linkages; and for assessing 
forces in the biological or cultural milieu which might show associations with 
psychopathology. 
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As a measure of psychopathology, psychiatric diagnosis has been much im- 
proved by the use of semistructured mental state interviews to standardize the 
scope and style of collecting information for diagnosis and to allow an analysis and 
synthesis of the discrete items of psychopathology which relate to diagnosis; and 
by the use of glossaries to aid in the definition of diagnostic labels. The use ofsuch 
standard diagnoses and diagnostically oriented symptom analysis for epidemio- 
logical purposes has been demonstrated by the US-UK Diagnostic Project in a 
study of recent admissions to public mental hospitals in London and New York 
(37, 84). Diagnoses and symptom ratings made by the project’s research psychia- 
trists were shown to be cross-nationally reliable, whereas the routine diagnoses by 
hospital psychiatrists were based on different though overlapping criteria in the 
two cities. The New York hospital psychiatrists had a much wider concept of 
schizophrenia compared to their London colleagues so that the hospital statistics 
gave a misleading picture of the relative distribution of diagnostic groups in the 
two cities. 

Unfortunately there is no evidence as yet that semistructured interview tech- 
niques and glossaries of diagnostic terms have done much to improve the reli- 
ability of routine hospital diagnoses which are potentially the backbone of 
epidemiological work based on admission records or case registers, Kendell (109) 
found that there was no appreciable change in the specification of diagnostic la- 
bels by psychiatrists in English mental hospitals after the promulgation of an 
official glossary for diagnostic categories in the mental disorders section of the 
International Classification of Diseases. 

This is not to beg the question of whether psychiatric diagnosis is necessarily the 
best reference point for epidemiological work. Some general population studies 
rely on inventories of symptoms which, though highly associated with psychiatric 
impairment, are not intended to cover the wide range of symptoms essential to the 
diagnostic process. However, when contrasting results are found between surveys 
based on diagnosis or diagnostic constellations of symptoms and those based on 
single symptom rates alone, then the most serious questions arise about the 
relative usefulness of these contrasting measures of psychopathology for the par- 
ticular purpose of that survey. Examples of such a contrast may be found in studies 
on the age distribution of depression where diagnostic surveys show the well- 
known bell-shaped curve of prevalence between 20 years and 60 years of age, 
while surveys based on symptom counts may show an age-prevalence curve 
resembling an inverted bell (71, 226). In terms of the purposes of psychiatric 
epidemiology (which have been described as serving the interests of public health 
efforts, clinical recognition of treatable cases, and research work into the etiology 
of psychopathology) it might seem that both diagnostic and symptom orientations 
are worth exploring, as well as other factors in case definition. Clearly a key area in 
epidemiological work is the identification of the intervening variables between 
symptoms and clinical diagnosis in a nonpatient population. 

Dohrenwend et al (44) have directed their attention to what they consider as the 
“central unsolved problem of psychiatric epidemiology—the problem of how to 
conceptualize and measure psychiatric disorder independently of treatment sta- 
tus.” They collected data by means of two different interview techniques, one a 
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mental state interview with open-ended questions and the assignment of responses 
to defined categories by rater judgment, and the other a screening interview for 
psychiatric impairment with prescribed response alternatives presented to the 
respondent. The interviews were conducted by psychiatrists on psychiatric pa- 
tients, community leaders, and a probability sample of five ethnic groups in a 
community. The investigators found, inter alia, that relations between judged 
disorder, ethnic status, and class status varied with the type of research interview 
and whether judgments were made face to face or by “blind” review of the 
protocols. Community subjects were judged to be more ill on the basis of protocol 
review than at face-to-face interview, and this discordance was greatest for com- 
munity leaders. The weighing of positive strengths that are observed in a face- 
to-face situation but not recorded is postulated as contributing to the discordance 
between interviewer-reviewer ratings of psychiatric disorder. 


Assessment Across Racial Boundaries 


The study cited above is a fine example of scientific carefulness and restraint in 
that the methodology is being painstakingly worked out before the substantive 
goal is tackled (in this instance, the relationship between social class and 
psychiatric disorder). This is not always the case. Often the substantive cart is put 
before the methodological horse, or methodological contributions are the result of 
secondary analyses of data collected for some other purpose. Thus it is not 
surprising that there have been many unconvincing statements and theoretical 
inferences in the literature about the differences between ethnic or racial groups in 
the frequency and forms of mental disorder. For example, it has been claimed that 
blacks, compared with whites, less often suffer from depressive disorders and more 
often suffer from schizophrenia; when depressed, the blacks are less likely to show 
guilt or suicidal trends and more likely to complain of somatic symptoms; and 
when schizophrenic they are more likely to show hallucinations. However, as 
Simon et al (236) remarked, “a major drawback to almost all the studies [until 
recently] is that, for data on psychopathology or diagnosis, they relied on hospital 
statistics, on the patients’ hospital records, or, in the few instances in which 
patients were actually interviewed, on the psychiatrists’ clinical impression.” 

Several recent papers have greatly improved the methodology in the assessment 
of psychopathology across racial boundaries. Simon et al (236) reanalyzed data 
from the US-UK Diagnostic Project in which semistructured mental state inter- 
views were carried out by research psychiatrists on recently admitted patients in 
public mental hospitals in London and New York. When the type and degree of 
psychopathology was controlled, a bias was revealed in the hospital diagnosis 
towards schizophrenic rather than depressive disorder in blacks compared to 
whites. When the research diagnosis was controlled there were no significant 
interracial differences in symptomatology among schizophrenics, but the black 
depressives reported significantly more anxiety, somatic complaints, and irritabil- 
ity than did the white depressives. 

Tonks et al (273) reanalyzed data from a study on depressed inpatients and 
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outpatients in New Haven, all of whom received semistructured mental state 
interviews and a diagnosis of depressive disorder from a research psychiatrist. 
Significant interracial differences were found for several symptoms, but were 
much reduced when social class was controlled, and all but vanished when pa- 
tients were matched for severity of illness; only the symptom of feeling helpless 
remained significantly different between the races (being more frequent in whites 
than blacks). As a whole, the black group was less severely depressed than the 
white group, possibly because the black patients tended to seek treatment earlier 
in the course of their disorder than did the whites. 

Liss et al (140) and Welner et al (279) examined patients who had been admitted 
to a private hospital in St. Louis, Missouri. One hundred and nine of these patients 
were followed up and given a semistructured mental state interview an average of 
3 years later. Blacks, compared to whites, were found significantly more often to 
show delusions (of reference, bodily change, and grandeur) and hallucinations 
(both auditory and visual). These differences were regarded as characteristic of the 
racial groups rather than as diagnostic factors, partly because symptoms and 
diagnosis were more highly correlated in whites than in blacks. 

Leff (129) has provided a closely argued hypothesis that as cultures develop the 
members of the culture learn to express emotional states in a more differentiated 
form. Primitive expression of a limited variety of somatic experiences grows into 
sophisticated references to a wide range of emotional states based upon learning of 
the different social contexts in which the somatic experience can occur. He sug- 
gests that possibly this progression of emotional differentiation parallels the de- 
velopment of individual as opposed to group identity. In support of his basic 
hypothesis, Leff reanalyzed data from the International Pilot Study of Schizo- 
phrenia (289), in which a semistructured mental state interview was used for psy- 
chiatric evaluation in five “developed” and four “developing” countries, in each 
case in the appropriate indigenous language. The emotions of anxiety, depression, 
and irritability showed generally higher correlations (i.e. less differentiation) 
among patients within developing than within developed countries. Leff also 
reanalyzed data from the US-UK Diagnostic Project (New York sample) to show 
that among public mental hospital admissions blacks showed less differentiation 
than whites even when the patients were matched for social class. In a further data 
analysis, based on the reliability trials in the IPSS study, Leff (130) showed that his 
previous findings on the differentiation of emotional states were probably not an 
artifact of differences in the rating behavior of psychiatrists from developing and 
from developed countries. 

It would appear that advances in method are enabling statements and theories 
on interracial differences in psychopathology to become more precise but no less 
challenging or conflicting than previously. It remains unclear whether conflicts 
between different studies result, on the one hand, from differences in methods of 
data collection, in rater bias, selection of the samples (e.g. in the referral me- 
chanisms for becoming a patient), diagnostic criteria, or, on the other hand, from 
cultural differences between the samples. However, an important product of this 
work has been to underline the importance of semistructured interviewing and 
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strict criteria for diagnosis as a means of reducing diagnostic errors in black 
patients, particularly with regard to detecting affective disorders. 


Effects of Language on Assessment 


The study of the relationship between race and psychopathology often confounds 
the effects of language and culture. By way of contrast, the studies of Marcos et al 
(146, 147) demonstrate the care required in experimental design to enable the 
isolation of the effects of interview language on the evaluation of psychopathol- 
ogy. These investigators prerecorded on audiotape the questions in a mental state 
interview and replayed the tapes to ten bilingual psychiatric inpatients with a 
hospital diagnosis of schizophrenia. Half the initial interviews were in Spanish and 
half in English, with a crossover design for a second interview within 24 hours. The 
patients’ responses were videotaped and rated by English and by Spanish speaking 
psychiatrists. Higher levels of pathology were rated from the English than from the 
Spanish interview, even in those patients who had higher vocabulary scores in 
English than in Spanish. In a subsequent study, bilingual raters reviewed the 
videotapes and directly compared Spanish and English responses to the same 
questions. They noted several possible reasons for the higher scores in the English 
than in the Spanish interview, including frank contradictions (i.e. admits to path- 
ology in English but not Spanish interview), increased tenseness and uncom- 
municativeness in English, downgrading of somatic complaints by Spanish 
psychiatrists because of high frequency in the Spanish culture, and the infracon- 
tent structure of the patients’ speech in English resembling the features of depres- 
sion (slowness, pauses, etc). However, having said all that, the authors attempt to 
resolve the conflict betweén their results and those of some previous workers by 
speculating on the interaction between language and the social context of the 
interview. Even this most careful study, because of its limited scope, leaves the 
generality of its findings in great uncertainty. 

Assuming that the psychopathology shown by a patient varies with the language 
in which he expresses his symptoms, there arises the question of whether the 
patient is more or less “sick” in one or the other language. Careful discussions of 
the different concepts and components of sickness have recently been given by 
Wing et al (281) and Mechanic (151). However, for the purposes of this review it is 
important to emphasize that the interview assessment is at best only a predictor of 
the patient’s behavior under the usual circumstances of his life, of his response to 
treatment, of his future progress, and possibly of the etiology of his disorder. These 
would seem to be the appropriate criteria to which we must ultimately refer in 
establishing the relative value or validity of cross-cultural variations in symptoms. 

Fortunately, there now exist translations from English into various foreign 
languages for most of the well-established techniques for reliably assessing psy- 
chopathology. For instance, the Present State Examination has been translated 
into Chinese, Czech, Danish, Hindi, Russian, Spanish, and Yoruba (289); the 
Psychiatric Status Schedule has been translated into Polish, Spanish, Italian, 
Hebrew, and Flemish (R. L. Spitzer & J. Endicott, personal communication, 
1974). For the most part comparability between the different language versions 
has been established by independent “back-translation.” An example of a care- 
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fully designed and organized multilingual study is the International Collaborative 
Study on the Efficacy of Loxapine, involving the use of English, French, and 
German versions of the Structured Clinical Interview (18a) for the assessment of 
psychopathology at initial and follow-up interviews. Such studies may contribute 
much to the identification of the predictive equivalence of symptoms between 
language groups. A comprehensive account of the problems and provisional 
solutions in cross-national social-psychological research is provided in a recent 
book by Manaster & Havighurst (145). 


Cross-Cultural Assessments 


Despite the many difficulties in assessing psychopathology cross-culturally, some 
solid ground is evident in that the symptom profiles of certain diagnostic cate- 
gories may be remarkably similar between different national, racial, and cultural 
groups. From this solid ground suggestions have been made for standardizing 
diagnoses by nominating a limited set of symptoms to define a group of patients 
whose diagnosis seems to meet with universal agreement, or by utilizing statistical 
techniques for reliably replicating the classification achieved by clinical diagnosis. 

Carpenter et al (23), reporting on data from the International Pilot Study of 
Schizophrenia collected by psychiatrists from centers in the nine participating 
countries, propose the use of 12 signs and symptoms of psychopathology as an 
operational definition of schizophrenia. In this study, 1202 patients recently hos- 
pitalized with evidence of psychosis were examined with a semistructured mental 
state interview and given a diagnosis according to the International Classification 
of Diseases. The wide range of items in the interview schedule were winnowed out 
by analysis of variance to find the strongest discriminators between cases diag- 
nosed schizophrenia and those given some other diagnosis. These discriminators 
were then subjected to a stepwide discriminant function analysis using the dis- 
tinction between schizophrenia and other diagnoses as the criterion. Twelve 
symptoms or signs were thus selected for differential diagnosis. The more of these 
symptoms that were present, the more likely was the case to be diagnosed schi- 
zophrenia. Taking into account both the initial and the replication samples, the 
presence of four or more of these symptoms identified correctly over 90% of the 
diagnosed schizophrenics and included less than 40% of the cases receiving other 
diagnoses. The corresponding figures for the presence of six or more symptoms 
was about 65% correctly and 5% incorrectly predicted to be schizophrenic. These 
promising results invite further work to establish whether or not the key items can 
be reliably rated by other research groups; whether the group of schizophrenics 
accurately identified in this way are more or less useful for research or clinical 
purposes than other groups of schizophrenics (e.g. for predicting outcome, treat- 
ment response, biological indices, genetic findings, and associations with possibly 
etiological factors); and whether nonschizophrenics can be similarly accurately 
identified for control groups in schizophrenia research. 


Use of Case Registers 


If cross-sectional epidemiological studies are difficult enough, an even more 
. complex and difficult task is the collection of data on a given population over time. 
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This aspect of epidemiology is vital for the description of the long-term course of 
mental disorder, for chronicling utilization of services, for assessing incidence 
rates, for studying differences between generations (successive cohorts), and for 
relating etiological factors to consequences remote in time. Case registers and 
cohort studies are being used with growing sophistication for such longitudinal 
epidemiological inquiries. 

Richman (204) has recently reviewed the use of case registers of psychiatric care 
in epidemiological research. He defined case registers as “systems whereby records 
from a specified set of psychiatric facilities are collected for individual persons 
from a defined population and accumulated over time,” and noted that the 
usefulness of the register increased with the duration of its existence. As of now 
there are several case registers in English-speaking countries which have been in 
operation over a decade, including those in Aberdeen, Scotland; Dutchess County 
and Rochester in New York; and Camberwell, London. However, many registers 
have been born and abandoned in the past 15 years, largely because of their 
expense and because the effort sunk into data collection was not matched by an 
equal devotion to data analysis and practical utilization of the results. It is there- 
fore particularly encouraging that Wing and his colleagues (281) have demon- 
strated the rich potential of a register in a volume which reports the use of the 
Camberwell Psychiatric Register between 1964-71 for monitoring and planning 
the introduction of a community psychiatric service. Wing emphasizes that this 
register is a scientific instrument “to measure the extent and type of current use of 
services, to facilitate the examination of patterns of contact over time, to monitor 
changes accidentally or deliberately introduced, to allow estimation of future 
trends, to indicate researchable issues and to act as a sampling frame for more 
intensive studies.” 

Gruenberg (81) draws attention to the importance of “population laboratories” 
enabling longitudinal studies on cohorts of births, relating the course of the 
pregnancy to the child’s mental development. He refers to the birth registers or 
cohort studies in Birmingham, Newcastle, Cardiff, and Aberdeen. 

Although the Scandinavian countries have long since shown the value of a 
national network of records on mental health statistics, similar enterprise has been 
relatively recent in the United States and the United Kingdom. In the United 
States, Kramer, Chief of the Biometry Branch, National Institute of Mental 
Health, and his colleagues have organized and analyzed the national statistics on 
admission rates to mental hospitals and other relevant inpatient and outpatient 
facilities for the purpose, among others, of applying epidemiological methods to 
the planning and evaluation of community mental health programs. Their recent 
work has emphasized the importance of taking household size and composition 
into account in identifying groups at high risk for requiring psychiatric treatment 
(123); of age variation in the pattern of use of various types of psychiatric facilities 
(124); and of the need for improving the completeness and comparability of case 
findings and diagnostic and record keeping systems within and between countries. 
In the United Kingdom, the Department of Health and Social Security collates 
certain information, including diagnosis, on every case admitted to public psy- 
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chiatric hospitals in England and Wales. In view ofthe relatively uniform and high 
standard of diagnosis i in these hospitals, these data have proved very useful for 
epidemiological inquiry as exemplified by the age distribution studies of Hare et al 
(89), Spicer et al (243), and McDonald (149). 


Biological or Cultural Milieu 


One side of the coin in psychiatric epidemiological research is the assessment of 
psychopathology so that its distribution in time and space may be plotted. The 
other side of the coin is the assessment of factors that might be associated with this 
distribution. 

Studies on family history, on genetic markers, and on associated disorders (e.g. 
alcoholism), have advanced knowledge on the etiology of manic-depressive dis- 
orders (157, 282). Refinement of indices for biological aging (14, 139) is making it 
possible to reexamine biological concomitants of age variation in the distribution 
of depression and dementia. 

The assessment of the cultural milieu has reached a stage of sophistication at 
which substantive examination of the interaction between the ecological niche and 
psychopathology is possible. The concept of life events and their change has 
proved a powerful approach to isolating one of the pathogenic influences in the 
environment. 

The pioneering work of Holmes, Wolff, Rahe, and their co-workers in estab- 
lishing the association between stress associated with life events and illness, and 
the great volume of related work that followed, has led in the last few years to 
intensive study of the types and timing of life changes that are particularly likely to 
precede the onset of various psychiatric disorders (16, 46, 180) and to community 
surveys aimed at examining the relationship between the distribution of measures 
of psychiatric symptoms or disorders and of life events across various demo- 
graphic groups (167, 274a). 

Although this area of measurement is replete with methodological pitfalls (17), 
consistency in social consensus on the degree of change in given life events across 
cultures (191) makes a measure of life events a possible tool for cross-cultural and 
cross-national research. 


4. ATTEMPTS TO IMPROVE PSYCHIATRIC DIAGNOSIS 


Robert L. Spitzer and Jean Endicott? 
Computerized Diagnosis 


The limitations of the clinical method for arriving at a psychiatric diagnosis, 
particularly the low reliability generally associated with routine clinical diagnosis, 
have led many investigators to explore the potential of computers for integrating 
the clinical observations into standard psychiatric diagnoses. Although some 


3 Both authors are also affiliated with the Department of Psychiatry, College of Physicians 
and Surgeons, Columbia University, New York City. 
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efforts have been made to develop programs in which the patient interacts directly 
with the computer via a console, the inherent limitations of this approach have led 
most investigators in this area to employ models in which the basic observations of 
signs and symptoms are still made by clinicians. These are the clinical observations 
which are then introduced into a computer for summarization into a diagnosis. 

There are several advantages to this kind of computer-generated diagnosis. 
First of all, there is the value of the computer’s necessarily perfect reliability in the 
sense that, given the same data, the computer program will always yield the same 
diagnosis. Secondly, the computer program can utilize rules developed from a 
larger and more diverse sample of actual patients than any single clinician can 
command. In addition, the rules by which a computer assigns a diagnosis are 
explicit and public. Finally, empirically based rules constitute at least potential 
advances in our scientific understanding of the complex relationship between 
symptom characteristics and diagnosis. 

The most frequently employed models for computer diagnosis are either dis- 
criminant function.or Bayes classification (11, 57, 171, 192, 240). Both of these 
models are empirically derived from data on patients for whom there are available 
clinical diagnoses. The data is usually psychiatric symptomatology, but may in- 
clude demographic and historical information. These two statistical methods are 
used to develop optimal classification rules for the particular sample to which they 
were originally applied, the “developmental” sample. The derived rules are then 
applied to classify new cases. The major limitation of these models is the require- 
ment of a very large sample size. Even with sample sizes in the thousands, which 
are rarely available, a very limited number of categories can be employed. A" 
second limitation is the lack of generalizability of the rules to samples which differ 
greatly from the developmental sample. 

The Bayes method has been applied to psychiatric classification by Birnbaum & 
Maxwell (11), Overall & Gorham (171), and Smith (240). The discriminant func- 
tion method has been used by Rao & Slater (193), Melrose, Stroebel & Glueck 
(152), and Sletten, Altman & Ulett (237). Sletten and his group have applied this 
method to five hospitals in the Missouri automated record keeping system and 
developed a method for classifying patients into one of 12 diagnostic groups. The 
overall agreement between computer classification and clinical diagnosis was 
sufficiently high for them to make the system operational in the Missouri Standard 
System of Psychiatry used in Missouri state hospitals. 

An alternative to the statistical models is the logical decision tree approach 
developed by Spitzer & Endicott (245, 246, 251). In this approach, which emulates 
the clinical differential diagnostic procedure, the computer program consists of a 
sequence of questions. The answer to each question is either “true” or “false” and 
rules out one or more diagnoses and determines which question is to be examined 
next. Some questions may ascertain the presence of a single critical sign or 
symptom, others whether a numerical score is in a certain range, and yet others 
whether a complex pattern of both signs and symptoms is present. The logical 
decision tree approach has the advantage of being independent of any specific 
body of data although, of course, not of accumulated clinical and research expe- 
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rience. For this reason the method can be used with different psychiatric popula- 
tions, since it is not designed to be optimal for a particular population of psy- 
chiatric patients. 

The two statistical models and the logical decision tree model have been 
compared on the same data base (64). The logical decision tree model performed 
as well as the two statistical models (in terms of agreement with clinical diagnosis) 
on a large cross-validation sample which was similar to the developmental sample. 
The logical decision tree method performed better than the two statistical ap- 
proaches on a sample drawn from a totally new population. 

The logical decision tree approach has been employed in three programs of 
increasing complexity. DIAGNO, DIAGNO II, and DIAGNO III by Spitzer & 
Endicott. The validity issue has been studied primarily by examining the agree- 
ment between the computer diagnoses and the diagnoses of expert clinicians, since 
the purpose of the program was to simulate expert clinical practice. When the 
computer and the clinician were limited to the same information (from research 
protocols of psychiatric ratings), the agreement between the computer and clinical 
diagnosis was as good as that between the diagnoses of pairs of clinicians (245, 
246). However, in a study in which the clinicians had direct access to the patient or 
a narrative case record, the agreement between the clinicians exceeded agreement 
between the computer and the clinicians. The output of these programs has been 
judged to have sufficient validity for use in various research projects for describing 
samples of subjects, selecting subjects for experiments, in epidemiological and 
cross-cultural studies, and investigating problems in classification. 

Dissatisfaction with the treatment relevance of standard diagnoses has led some 
investigators to employ computers to develop treatment-relevant typologies 
rather than to simulate the practices of expert clinicians using the standard 
nomenclature (93, 119, 172, 238). Each of these programs has been shown to have 
some value in predicting which patient will respond best to a limited number of 
drugs in samples similar to those used to develop the program. The general utility 
of such programs across samples of patients who differ from the samples from 
which the programs were derived has yet to be demonstrated. 

In so far as the computer is used to simulate expert clinical practice in making 
a standard psychiatric diagnosis, the major constraints on further developments in 
computerized diagnosis appear to lie in limitations in the standard nomenclature 
itself. Although improvements in the raw data and the computer algorithms may 
lead to some improvement in computer diagnosis using the current ambiguous 
criteria, a quantum jump in the validity of computer diagnosis is dependent upon 
improvements in the system itself. 


Specified Criteria for Psychiatric Diagnosis 


In the usual diagnostic manual, such as the American Psychiatric Association’s 
Diagnostic and Statistical Manual (DSM-ID, diagnostic stereotypes are described 
under each category. The task of the diagnostician then becomes one of selecting 
the diagnostic category in which the stereotype most closely resembles the char- 
acteristics of the patient being diagnosed. In addition, the standard nomenclature 
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includes many categories for which there is little validity evidence in terms of 
discriminability from other conditions, long-term follow-up studies, specific 
treatment response, typical course, and elevated familial incidence. 

The Renard hospital group in St. Louis (54) has developed a diagnostic system 
which limits itself to those conditions for which it believes there is good research 


Table 1 Specified criteria for major depressive illness (A through E required) 


A. Dysphoric mood characterized by symptoms such as the following: depressed, sad, blue, 
hopeless, down in the dumps, empty, “don’t care,” irritable, fearful, worried. The dysphoric 
mood must be prominent and relatively persistent but not necessarily the most dominant 
symptom. 


B. At least 5 of the following symptoms for definite and 4 for probable (for past episodes 
because of memory difficulty, the criteria are 4 and 3 symptoms): 

l. Poor appetite or weight loss (2 lbs. a week or 10 Ibs. or more a year when not 
dieting) or increased appetite or weight gain. 

2. Sleep difficulty or sleeping too much. 

3. Loss of energy, fatigability, or tiredness. 

4. Psychomotor agitation or retardation (but not mere subjective feeling of restlessness or 
being slowed down). 

5. Loss of interest in usual activities, or decrease in sexual drive, (do not include if limited 
to a period when delusional or hallucinating). 

6. Feelings of self-reproach or excessive or inappropriate guilt (either may be delusion- 
al). 

7. Complaints of or evidence of diminished ability to think or concentrate, such as slow 
thinking or mixed-up thoughts (do not include if associated with obvious thought 
disorder). 

8. Recurrent thoughts of death or suicide, including thoughts of wishing to be dead. 


C. Depressive features of illness lasting at least 2 weeks, 


D. Sought help from someone during the dysphoric period or had impaired functioning 
socially with family, at home, or at work. 


E. None of the following which suggests schizophrenia is present: 

1, Delusions of control or thought broadcasting of any duration if definitely present. 

2. Hallucinations throughout the day for several days, or intermittently throughout a I 
month period unless content is clearly related to lowered or elevated self-esteem. 

3. Auditory hallucinations in which either a voice keeps up a running commentary on the 
patient’s behavior as it occurs, or 2 or more voices converse with each other (of any 
duration as long as definitely present). 

4, Atsome time during the period of illness had delusions or hallucinations for more than 
1 month in the absence of prominent affective (manic or depressive) symptoms 
(although typical depressive delusions, such as delusions of guilt, sin, poverty, nihil- 
ism, or self-deprecation or hallucinations of similar content are permitted). 

5. Preoccupation with a delusion or hallucination to the relative exclusion of other 
symptoms or concerns (other than delusions of guilt, sin, poverty, nihilism, or self- 
deprecation or hallucinations with similar content). 

6. Definite instances of schizophrenic thought disorder. 
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evidence for the validity of the conditions as diagnostic entities. The system 
provides for explicit criteria which are in the form of sets of items, a specified 
number of which are necessary or sufficient to make the diagnosis. A consequence 
of this approach is the recognition of the need for an “undiagnosed psychiatric 
disorder” category for the patients who do not meet any of the specified disorders, 
yet exhibit significant psychopathology. 

Spitzer, Endicott, and Robins (personal communication, 1974), as part of a 
collaborative study of the psychobiology of depression, have expanded the Ren- 
ard system to include other categories and have modified and refined the specified 
criteria in an effort to improve reliability. An example of the criteria for one 
category, Major Depressive Illness, is shown in Table 1. 

The reliability of these diagnostic criteria has been evaluated based on both 
interviews of patients and case records. The reliabilities obtained with 104 newly 
admitted psychiatric inpatients from four participating facilities were generally 
considerably higher than those reported in other diagnostic reliability studies 
(254). The study of case records used 120 patients from the New York State 
Psychiatric Institute and permitted a comparison between the reliabilities ob- 
tained when research assistants used the specified criteria and when a different set 
of experienced clinicians used the official nomenclature of the American Psy- 
chiatric Association. The results of the case record study are shown in Table 2, 


Table 3 Kappa coefficients of agreement for major diagnosis on 120 case records 


Kappa using Kappa using 
Diagnostic category specified DSM-II 

criteria criteria 
Major groupings 
Affective .70 .48 
Schizophrenia .84 A8 
Other 14 ` 61 
Specific Categories 
Nonaffective schizophrenia 70 49 
Schizoaffective schizophrenia 24 ) 37 
Manic illness -76 37 
Major depressive illness# .66 ‚23 
Minor depressive illness? 56 DI 
Alcoholism DÉI —* 
Drug dependence 1.00 —<¢ 
Obsessive-compulsive .65 42 
Anxiety neurosis we .06 
Phobic neurosis .66 Ke 
Borderline psychosis .20 
Undiagnosed -59 


* Considered equivalent to any psychotic affective illness in DSM-II. 
5 Considered equivalent to Neurotic Depression in DSM-I. 
© Neither of the raters used this category. 
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using a measure of agreement called kappa, which indexes agreement for nominal 
categories corrected for chance agreement (244). Not only are virtually all of the 
reliabilities higher for the research assistants using the specified criteria, but the 
values obtained are higher than those ever reported in diagnostic studies of 
heterogeneous samples of patients. 

To further increase the reliability of this set of diagnoses, an instrument has 
been developed by Spitzer and Endicott, called the Schedule for Affective 
Disorders and Schizophrenia (SADS). The SADS has a structured interview and a 
set of rating scales to enable a clinician to collect the information necessary to 
make the diagnostic distinctions provided in the list of diagnostic criteria. In 
addition, a computer program will be developed which will use the specified 
diagnostic decision rules to classify patients on the basis of the individual items. It 
is quite clear that one way of increasing reliability of diagnoses is to provide more 
specific criteria for each diagnostic category. 


5. A PSYCHOPHYSIOLOGICAL APPROACH 
TO PSYCHOPATHOLOGY 


Samuel Sutton 


The rationale for applying an experimental approach to mental disorders derives 
in part from the need for objectifying and measuring the observations of the 
clinician on the characteristics and behavior of patients and, more deeply, from 
the hope of finding some kind of indicator of difference in central nervous system 
functioning that might define a group of patients. The concern of this section is 
primarily with the latter aim. In remarking some time ago on the subtlety and 
complexity of the etiological factors in mental illness, as well as on the kinds of 
artifact which may arise in biochemical research involving the analysis of body 
fluids and tissues, Kety (112) noted that it would be highly desirable if some clues 
as to what should be looked for could be supplied to the biochemist from other 
fields—not least of all, psychology. 

It might seem paradoxical that one would turn to psychological rather than 
physiological responses when one’s primary goal is to obtain some reflection of 
central nervous system functioning. However, to the extent that the physiological 
variables are sensitive to the psychological state of the subject (263), no advance is 
made merely by moving from the psychological to the physiological level. Thus it 
is true that schizophrenic patients may be shown to have unusual evoked potential 
waveforms to stimuli whose identity they did not know in advance (133). However, 
from the point of view of providing etiological clues, such findings may be viewed 
as redundant support of available psychological data, e.g. reaction time, which 
have shown that the response of schizophrenic patients to the condition of uncer- 
tainty is different from that of normal subjects (266). Both the physiological and 
psychological data share a common problem of validity. In neither case do we 
know whether the difference between groups is in the way they respond to un- 
certainty, or whether other differences between groups, such as the level of 
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attention or motivation, determine the findings. The problem of validity can be 
solved at the psychological level by eliminating or controlling the effect of ex- 
traneous psychological processes on the behavior on which the experimenter is 
focused. Only then might the obtaining of physiological correlates of deviant 
performance be illuminating with respect to etiological questions. 

The fundamental difficulty in the interpretation of findings arises from the fact 
that patient groups almost universally perform more poorly than normals. But it is 
the sources of the poor performance that are unclear. It may in fact be true that the 
patient has a deficit in information processing, or in perceptual sensitivity, or in 
psychomotor function. However, the poor performance may have nothing directly 
to do with these functions and may arise for quite different reasons. The patient 
may be less cooperative or less attentive. He may be concerned with his troubles, 
may be preoccupied with his hallucinations, and may care very little about this 
apparently meaningless task we put before him. Given the disturbed state of the 
patient, the complexities of the task may be too much for him. Paradoxically we 
may obtain erroneous results because the patient is too motivated, too anxious, 
and too concerned with the results of his performance, and as is well known, a very 
high degree of anxiety and motivation often lead to poor performance. The 
problem in obtaining valid data in these situations is in a sense not logically 
different from the problems of controlling for “faking” or malingering which has 
been most often raised in relation to responses to questionnaires (228). 

Several strategies have been developed for coping with some of these problems 
and obtaining more valid indicators of central nervous system differences. These 
will now be presented (98a, 264). 


Simplicity of the Task 


Simple reaction time can be used to reflect fairly subtle discriminations between 
stimuli (242, 257). The subject’s task is only to lift his finger as rapidly as possible 
when one of several stimuli is presented. No discrimination is required in this 
procedure. Nevertheless, a finding of differences in reaction time to different 
stimuli inescapably leads to the inference that the nervous system is capable of 
appreciating the differences among them. The advantage in such a strategy is that 
the same stimuli presented as a discrimination may for whatever reason prove to 
be too difficult or too subtle a task for certain groups of patients. This would lead to 
the usual uninterpretable “poorer” performance. 

However, the reaction time of schizophrenic patients, and to some extent of 
most hospitalized psychotic patients, is generally slower than normals to all stim- 
uli. This brings into play a second strategy. 


Own-Control Design 


The rationale of the own-control approach is based on the hope that confounding 
or trivial factors enter equally into the two or more measures being compared. This 
would presumably permit the ruling out of confounding factors by the use of 
difference scores or some other maneuver. While this can be a powerful tool, it can 
also lead to serious misinterpretations. For example, if the measures being used 


650 ZUBIN et al 


involve tasks of different degrees of difficulty, then it is possible that the level of 
difficulty can interact with differences between groups in motivation or attention. 
Performance on the difficult task may be more severely handicapped by low 
motivation than performance on the easy task; thus a difference between groups in 
motivational level would make performance on the easy task an inappropriate 
correction factor for comparison of groups on the more difficult task. Further- 
more, this process of correcting for performance at one level by performance at 
another level has a number of other pitfalls, and one cannot with impunity take 
difference scores or use even covariance analysis indiscriminately in all cases (66, 
264). 

Patients and normals must be tested on identical experimental conditions to 
avoid bringing range effects into play (187). Furthermore, care must be used to 
randomize experimental conditions, by trial if possible, so that subjects cannot 
assume different attitudes to various experimental conditions, e.g. the attitude that 
“this is a challenging condition I will try harder”; or “this is a very difficult 
condition, I give up on this one”; or “this is an easy condition, I don’t have to try so 
hard”; and so on. Even with all these provisos, a series of sequential experiments 
may be necessary to establish the interpretation of a particular finding (188). For 
example, Sutton & Zubin (266) reported that schizophrenic patients have longer 
reaction time than normals when the immediately preceding trial is in a different 
sensory modality than when the immediately preceding trial is in the same sensory 
modality, even though the stimulus of the previous trial occurs as long as several 
seconds earlier. This appears to imply a difference in bias or expectancy or 
segmental set on the part of the patients. However, in a subsequent experiment it 
was shown that the greater crossmodal retardation continues to be found in such 
patients even when the subject is told prior to each trial what the sensory modality 
of the stimulus will be (275a). 


Controlling the Criterion Variable 


Signal detection theorists (79) have been able to demonstrate that they can obtain 
two independent measures in threshold situations: one is a measure of sensitivity 
called d: and the other is a measure of criterion’ called ß. The basic advance 
consists of the fact that it can be demonstrated both logically and experimentally 
that these two measures can be obtained independently of each other. This is an 
important step forward since many patients tend to be more cautious than nor- 
mals. It is indeed this cautiousness which accounts for the earlier reports that 
schizophrenic patients were less sensitive in their ability to detect flicker. What 
Clark, Brown & Rutschmann (28) have shown is that, rather than being less 
sensitive, the patients appear to havea need for seeing definite flicker before they 
are willing to say that the light is flickering. With classical psychophysical methods 
such a bias yields an apparently poorer sensitivity. When measures of sensitivity 


4 Beta is the term more commonly used to refer to the signal detection measure of criterion. 
Actually, what is usually computed from hits and false alarm rates is L. For the difference 
between L, and £, see Swets, Tanner & Birdsall (267). 
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are obtained which are independent of the criterion variable, there is no difference 
between schizophrenic patients and normals in sensitivity to the rate at which light 
and darkness must alternate before flicker is detected. The pioneering papers by 
Clark and his colleagues (27, 28, 31) have cut the underpinnings from a great deal 
of research comparing the sensitivity of patients and normals which was done by 
classical psychophysical techniques. If the difference in performance is in fact due 
to the criterion variable, then it is the criterion variable and not sensitivity which 
requires further study. 

Several other investigators have since reported that the use of signal detection 
methods yields a finding of no differences between groups. Milner et al (158) 
found no differences in either d’ or ß for the detection of auditory tones in noise by 
obsessional and nonobsessional patients. However, in one experimental condition 
which permitted the subjects to ask for repeat observations of the signal before 
making a decision, it was found that obsessional patients asked for repeats more 
than four times as often. Clark & Rubin (30) found that while the constant 
stimulus method showed that the psychiatric patients (who were older) had poorer 
thermal sensitivity than the normals, a signal detection theory analysis of the same 
data showed no differences in d’, but rather differences in the criterion variable. 
The psychiatric patients were more conservative. 

Not all use of signal detection methods has led to negative results with respect to 
sensitivity differences between psychiatric or retarded groups and normals. 
Korzh & Vaytkyavichyus (122) found differences between schizophrenic patients 
and normals in alpha suppression and electromyographic responses to auditory 
stimuli presented in a signal detection task. Kuechler & Dodwell (128) found that 
while normals increased their d’ for the detection of a tone in noise under the stress 
of electric shock, neurotic and schizophrenic patients showed a decrement in d’ 
under these conditions. Boissonneault et al (13) found da’ differences in the reso- 
lution of paired flashes only under a high exercise-induced activation level—but 
not under moderate or low activation levels—between extroverts with a high 
neuroticism score and extroverts with a low neuroticism score. Suboski (262) 
found signal detection theory of use in the analysis of results of published experi- 
ments which investigated classical discrimination conditioning in relation to rat- 
ings on manifest anxiety. 

In a comparison of retarded and normal adults, Bernstein & Day (10) found 
differences between groups in size constancy with both classical and signal detec- 
tion psychophysical methods. In the Price & Eriksen (189) study of size constancy, 
differences were found between nonparanoid schizophrenic patients, paranoid 
schizophrenic patients, and normal controls with the signal detection measure, 
despite the findings of no differences with the classical psychophysical measures. 
When criterion is uncontrolled, no real inference about sensitivity can be made; 
therefore, we should not be surprised by whatever permutation of positive and 
negative results we obtain with classical psychophysical and detection theory 
methods. 

In psychiatric illness, differences between groups in the criterion variable are 
more often than not viewed as an artifact which interferes with the evaluation of 
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differences in sensitivity. Signal detection methods, however, have also been 
applied where the primary goal of the study was to look for criterion differences 
between groups, rather than for sensitivity differences. In one study, Carney (21) 
found no differences between delusional and nondelusional schizophrenic pa- 
tients in their response to the use of monetary payoff to manipulate the criterion 
level. The assessment of the effects of aging presents a contrast to the assessment of 
psychiatric illness. While investigators of aging are also concerned with the evalu- 
ation of sensitivity, the hypothesis that criterion differences such as conserva- 
tiveness or cautiousness are associated with aging has been of particular interest to 
investigators. Craik (42) reviewed the evidence from his own earlier study as well 
as the available literature which used signal detection methods. While support was 
found for high fs (greater cautiousness) to be positively associated with age, there 
were also negative and contradictory findings. Evidently the nature of the task 
enters as a factor in determining whether or not a relationship with age will be 
found as well as the direction of the relationship (29, 30, 78, 203). l 

The evaluation of the effects of drugs on performance in both normal subjects 
and patient groups raises many of the same difficulties that comparison of normal 
and patient groups raises. Here, too, the problem which arises is whether the drug 
has affected the subject’s sensitivity, his criterion, his attention level, his motiva- 
tion, etc. An increasing number of drug studies using signal detection theory 
methods in an attempt to disentangle some of these effects are beginning to appear 
(48, 49, 82, 143, 164, 194-197, 225, 290). 

Problems of comparing patient groups with normals are by no means all solved 
by the use of signal detection methods. Thus Rappaport et al (196), who used 
signal detection methods, discounted a finding of poorer sensitivity in nonpara-. 
noid schizophrenic patients than normals when they found that at still lower 
stimulus intensities where the discrimination is more difficult, the same patients 
were no less sensitive than the normals. They argue that such contradictory 
findings reflect the fact that signal detection methods cannot cope with the vari- 
able attention level characteristic of certain patient groups. However, strictly 
speaking, they did not use a signal detection format since a “no” response was 
defined as a failure to make a response (48). 


Accuracy Indicators 


While the subject’s response in the “yes-no” or one-interval signal detection 
format is a forced choice in that the subject must make a decision, other forced- 
choice designs have different properties. Thus in the three-interval forced choice, 
the subject must in each trial decide whether the stimulus was in the first, second, 
or third observation interval. Such a procedure differs from the one-interval 
detection theory procedure in two important ways. First, it provides the subject 
with a comparison in each trial—namely, the observation interval in which the 
stimulus was present compared with the two observation intervals in which the 
stimulus was not present. Second, the subject is in effect being forced to respond 
“present” following each trial, eliminating the usual avenue for expressing cau- 
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tiousness or laxness of criterion. Thus the three-interval forced-choice procedure is 
criterion free; criterion in the usual sense does not enter nor can it be measured. 
But it should be noted that it is not free of response bias. The subject may not 
distribute his responses equally among the three observation intervals (although 
the stimuli are equally distributed). When this occurs, it can be corrected for by 
examining the hit rate separately for each interval, thus correcting for the subject’s 
response bias. In a spatial forced choice, Calfee (20) showed that subjects did not 
utilize the end positions and the center positions in the same way. This was a 
short-term memory task and subjects remembered stimuli in the end positions 
better than stimuli in the center positions. As a result, when they were uncertain 
they more often picked center positions. This bias was the same for retarded as for 
normal children. 

As with signal detection methods, few investigators are using forced-choice 
methods to compare groups. Emmerich & Levine (48) found that schizophrenic 
patients had less sensitive auditory thresholds than normals. Bruder et al? found 
that only patients with affective psychoses had less sensitive auditory thresholds 
than normals. The sensitivity of schizophrenic patients was found to be within the 
normal range. Hare (91) found that psychopathic criminals were less sensitive in 
detecting electric shock than nonpsychopathic criminals. 

Broen & Nakamura (15) compared the auditory sensitivity of acute paranoid 
and chronic nonparanoid schizophrenic patients under distraction and no dis- 
traction. They found no difference under the no-distraction condition, but under 
distraction the sensitivity of the chronic nonparanoid patients deteriorated while 
that of the acute paranoid patients did not. Neale et al (169) found that there were 
no differences in the ability of normals, good premorbid paranoid schizophrenic 
patients, and poor premorbid nonparanoid schizophrenic patients to identify 
whether a tachistoscopically displayed letter was a T or an F. However, when the 
signal letters were imbedded in an 8 letter matrix, both schizophrenic groups 
performed more poorly than the normals. Stilson & Kopell (259) compared 
normals, nonschizophrenic, and schizophrenic patients in their ability to identify 
geometrical figures against a background of visual noise as opposed to no visual 
noise. All forced-choice thresholds were elevated by visual noise; the schizo- 
phrenic thresholds were most elevated, the nonschizophrenic patients were in- 
termediate, and the normals were least elevated. Smith (239) reported that chronic 
schizophrenic patients showed an impairment in short-term memory when com- 
pared with nonschizophrenic patients. Also see Shallice & Warrington (229) fora 
study of short-term memory in a case of conduction aphasia. 

Walker & Birch (276) had schizophrenic and nonschizophrenic children 
between ages 7-11 select from three visual dot patterns the one which best 
matched an auditorily presented temporal pattern. Schizophrenic children at each 
age made fewer correct matches. 


5G. E. Bruder, S. Sutton, H. Babkoff, B. J. Gurland, A. Yozawitz, J. L. Fleiss & M. L. 
Kietzman. Auditory signal detectability and facilitation of simple reaction time in psychiatric 
patients and nonpatients. Unpublished manuscript. 
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Better Patient Performance 


While the use of forced-choice methodology controls the criterion problem very 
well and also gives us a safeguard against falsely attributing to patients a better 
performance than they can actually accomplish, it does not solve the fundamental 
problem. One can argue very persuasively that the patients who have poorer 
thresholds than normals were simply less attentive or less motivated than normals. 
Presumably, according to this argument, had the patients been in a state which 
permitted them to be fully attentive and cooperative they might have obtained just 
as good thresholds as the normals. There is only one satisfactory answer to this 
argument. If we could demonstrate in a particular task that patients were able to 
perform better than normals, it would then not be possible to attribute the better 
performance for that task to poorer attention and cooperation on the part of the 
patients. There are so far only a few instances of reported better performance on 
the part of patients. 

Bruder et al (see footnote?) presented randomly either a single click which was 
25 dB above each subject’s absolute threshold, or the 25 dB click followed 15 msec 
later by a click which was 10 dB above the subject’s absolute threshold. The subject 
lifted his finger from a key as rapidly as possible in response to the presentation of 
either stimulus package. They found that reaction time of normal subjects to the 
paired clicks was only very slightly faster than reaction time to the 25 dB click by 
itself. However, for patients with affective psychoses, reaction time to the paired 
clicks was much faster than reaction time to the 25 dB click by itself. Here then is 
evidence that these patients were benefiting more from the presence of the 10 dB 
click than were the normals. The patients with affective psychoses were more 
“sensitive” to the presence of the 10 dB click and in that sense were performing 
better. l 

Collins (33) presented an even more convincing case of better performance. 
There were two 2 msec flashes of light, which in one stimulus package were separ- 
ated by 2 msec giving a total duration of 6 msec, and in another stimulus package 
were separated by 0 msec—that is, a continuous 4 msec fiash of light. The two stim- 
ulus packages had the same total amount of light energy; however, the energy was 
distributed differently over time. As is known from Bloch’s Law, if the total 
stimulus duration of both packages is shorter than a value known as critical 
duration, then the level of response to these two packages of light will be identical. 
If, however, one of the stimulus packages exceeds critical duration for the class of 
response under consideration, then the level of response to the two equal-energy 
packages will be different. These two stimulus packages were presented in random 
sequence, and subjects lifted their finger as rapidly as possible whichever stimulus 
package appeared. For normals and nonschizophrenic psychiatric patients no 
reaction time differences were found between the two stimulus packages. How- 
ever, schizophrenic patients showed significantly longer reaction times to the 6 
msec package of light than to the 4 msec package of light. From this it is inferred 
that for schizophrenics, critical duration is shorter than 6 msec. Presumably, for 
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this group some portion of the 6 msec package is not fully integrated. It therefore 
contains less effective energy and reaction time is longer. 

These experimenters (33, 264) make no claim that a shorter critical duration is 
biologically “better” than a longer critical duration—in fact, the reverse may well 
be true. But whichever is functionally more advantageous, the schizophrenics in 
this design make a discrimination, in the sense of different reaction time to the two 
packages, which normals are unable to make. (The two stimulus packages are also 
perceptually identical for normals;° comparable perceptual studies are not avail- 
able for patients.) It would be difficult to attribute such a group difference to 
defective attention or cooperation on the part of the patient group. 

Two.studies of behavior at a much more complex level report findings which 
may also be interpretable as better performance by patients. Goldfarb & Braun- 
stein (75) reported that the speech of schizophrenic children is less disturbed than 
the speech of normal children by a delay in auditory feedback while speaking. 
Levine et al (132) reported a similar finding for schizophrenic adults. However, an 
earlier study in schizophrenic adults by Sutton et al (265) reported negative results. 


Tterative Procedures 


All the above strategies are addressed to the design of the tasks that the subjects 
perform. However, there is at least as serious a problem in defining the indepen- 
dent variable, namely the group to which each subject belongs. For the definition 
of patient groups, there is the problem of the lack of objectivity of the criteria and 
the lack of agreement among clinicians. Less well recognized is the fact that a 
“normal” population may often contain a significant proportion which is far from 
normal. For the classification of both patients and normals, the use of standard- 
ized rating scales which have good reliability is a distinct advantage. Kriegel 
(126) reported that in her samples only half of the hospital-diagnosed schizo- 
phrenics were diagnosed schizophrenic by rating scale procedures. Even more sur- 
prising is the fact that only half her normals, recruited through the usual channels, 
were diagnosed as normal by the rating scale procedures. When her group 
comparisons were confined to the “pure” schizophrenics and the “pure” normals, 
not only were the reported differences in crossmodal retardation larger, but also 
intragroup variability was found to be smaller. Along similar lines, in the Collins 
study (33), it was those schizophrenics who rated high in “speech disorganization” 
who demonstrated shorter critical duration. By similar post hoc analyses, Satter- 
field (223) showed that it was depressives with a family history of mental illness 
who were most different from normals in their evoked potential waveforms. 
Because such findings are post hoc discoveries, they must be subjected to 
replication before being accepted. Sutton (264) has argued that it is through such a 
series of sequential experiments that the experimental psychologist may make the 


6 M. L. Kietzman, R, Rutschmann & E. Shapiro. Visual temporal integration: A compar- 
ison of discrimination and reaction time to pairs of suprathreshold flashes. Unpublished 
manuscript. 
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greatest contribution toward the “purification” of groups and thus to the problems 
of diagnosis and etiology. A series of sequential experiments permits the utiliza- 
tion of an “iterative” strategy. For example, after replication of the initial finding 
with respect to critical duration, the measure of critical duration can then be used 
as the independent variable in selecting subjects to see whether it is only schizo- 
phrenic “speech disorganization” that is associated with shorter critical dura- ` 
tion. Other objective, physiological, and clinical techniques can then be brought to 
bear in the attempt to define and describe subpopulations of patients who by other 
procedures are inseparable from poorly defined larger categories. 
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SUMMARY AND CONCLUSIONS 


Joseph Zubin 


With the tools described in the prior sections we can proceed to examine some of 
the scientific models that have been proposed for the etiology of the mental 
disorders. 


Descriptive Model 


In the descriptive area we have made considerable progress in demonstrating that 
systematic structured interviews of high reliability can be successfully applied to 
answering such questions as (a) whether more schizophrenic than affective dis- 
orders are admitted to the hospitals in London than in New York (see p. 637); 
(b) whether diagnoses arrived at by computers are comparable in reliability to 
those arrived at by clinicians (see section by Spitzer & Endicott); (c) whether 
patients who differ from their peers in the processing of information in the central 
nervous system show any characteristic difference in their psychopathology (see 
pp. 654-55); and (d ) whether different treatments affect psychopathology differen- 
tially (95). 

In addition to the quantitative approaches some notable progress has also been 
made in the historical documentation of qualitative description of psychopatho- 
logical phenomena. Thus Sir Aubrey Lewis (134-136), one of the collaborators on 
the bilateral studies of diagnoses in the United States and the United Kingdom has 
traced the varying phenomenological description of the terms of anxiety, para- 
noia, and endogenous-exogenous historically and indicated their clinical signifi- 
cance in each period, including their present connotations. He has similarly 
described the development of the diagnostic schemas (137). Fleiss (62), on the 
other hand, with the help of his clinical colleagues has succeeded in developing a 
dimensional analysis of psychopathology based on the results of systematic inter- 
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views which yields measures of anxiety separated from depression, and of flatness 
of affect separated from retardation. These separations were rarely achieved 
before in the factor analytic literature, and only a combination of clinical acumen 
supported by factor analytic flair succeeded in purifying these factors. They have 
already demonstrated their usefulness in separating clinically based categories of 
patients. 

The diagnostic categories themselves have been subjected to reliability investi- 
gation. Thus, in an examination by the US-UK project of a sample of American 
patients whose interviews were tape-recorded, a second rater reviewed the tapes 
and applied a second diagnosis (65). The overall agreement as measured by kappa 
for the primary diagnosis given by each diagnostician was .51; the agreement 
coefficients for schizophrenia and for affective psychosis were both .59. When 
secondary diagnoses were considered, the value of kappa rose to .64 for overall 
agreement, to .91 for schizophrenia, and to .68 for affective disorders. 

A second estimate of reliability was obtained by noting the distribu- 
tion of diagnoses made by the New York team on 96 American patients 
whom they interviewed and by the London team on 49 American patients whom 
they interviewed. Since these two groups of patients were unselected and 
could be regarded as a random sample of the total, the remarkable degree 
of agreement in the distribution of diagnoses attests to their high degree of reli- 
ability. It still remains an open question whether the diagnoses made by the 
state hospital clinicians in the United States and the corresponding area hospital 
clinicians in London differed because of differences in sensitivity to the presence 
of schizophrenic vs affective psychopathology or because of preference in style of 
diagnosis. Accepting the project diagnosis as the correct diagnosis, the signal 
detection theory method was applied to determining whether the two groups of 
clinicians differed in sensitivity or in criterion. The difference in sensitivity turned 
out to be very small, but significant differences in the criterion—a greater readiness 
to apply the label of schizophrenia in the United States than in the United 
Kingdom—were noted. Apparently American psychiatrists are as sensitive as the 
English psychiatrists to the presence of psychopathology of the schizophrenic and 
affective type, but the greater readiness of the Americans to apply the schizoph- 
renic label and the greater readiness of the British to apply the affective label 
produced the observed difference in national statistics (W. C. Clark, personal 
communication, 1974), 

Perhaps one of the most striking recent developments in the use of these 
interviews is the 9-country investigation of schizophrenia in centers throughout 
the world (289). Through a variety of statistical procedures, 12 items from the 
systematic structured interview (Present State Examination) were found charac- 
terizing the group of patients who were given the label of schizophrenia in these 9 
centers. In a replicated study, the presence of 8 or more positive symptoms out of 
the 12 identified about one-fifth of the schizophrenics (four-fifths false negatives) 
but not a single nonschizophrenic scored that high. Adopting a more lenient cutoff 
point at a score of 6 or more, two-thirds of the schizophrenics are correctly 
identified (only one-third false negatives) and only 5% of the nonschizophrenics 
(false positives) fall into the net. It must be remembered, however, that the 
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psychiatrists making these diagnoses, though indigenous to their own particular 
cultures, were all trained in the West and were viewing the patients through a 
Western-tinted screen. Nevertheless, the presence of similar groups of patients 
across this wide cultural spectrum would argue for the ubiquity of schizophrenia 
and presents a quandry for those who regard the diagnostic nomenclature as a 
myth. The items forming the 12 point scale included: 1. absence of waking early, 
depressed facies and elation, which represent affective components unfavorable to 
a diagnosis of schizophrenia; 2. presence of delusions (widespread, bizarre, nihil- 
istic, and thoughts aloud); 3. disturbance of communication (incoherent speech, 
poor rapport, and unreliable information); and 4. poor insight and restricted 
affect. 


Etiological Models 


The etiological models that have been proposed for the mental disorders range 
from field-theory_based models, imbedded in the ecological niche that the person 
occupies, to atomistic (biological) models of the genetic variety. Leaning towards 
the ecological model but containing also biological elements are two psychologi- 
cally based models—the developmental model and learning theory model—while 
the two physiologically based models—the interna] environment model and the 
brain function model—lean more in the direction of the atomistic approach. 

This composite review has dealt with the learning theory model in the section by 
Salzinger and with the neurophysiological model in the section by Sutton. We 
shall not deal with these further, but instead will devote some attention to the 
ecological, developmental, genetic, and internal environment models. 

The ecological model for the etiology of mental disorders assumes that their 
cause is to be sought in the stressors affecting the ecological niche which the person 
occupies. A full taxonomy of the parameters of the ecological niche and the stress 
they produce is not yet available, but such factors as low socioeconomic status 
have been implicated either as causes or as effects of mental disorder. A current 
investigation by the Dohrenwends (45) may throw light on this question. Assum- 
ing that old American stock who are now found in the low socioeconomic strata 
have gravitated there because of their lack of capacity to rise with the upward 
mobility thrust of the general population, they would tend to have higher rates of 
mental disorder than the newly arrived immigrants who contain a large proportion 
of capable people in their midst who have not yet had the opportunity to rise. On 
the other hand, if we assume that the stressful environment is the cause of mental 
disorder, we would expect the newly arrived immigrants who have risen to the 
higher levels to have more mental disorder than their peers of the old American 
stock who have not had to struggle to rise. 

Perhaps the most striking evidence for the possible role of the ecological model 
comes from H. B. M. Murphy (166), who has found that whenever ethnic groups 
containing both Protestant and Catholic subgroups are compared by religious 
subdivisions for first admission rates for schizophrenia, the Catholic subgroup 
invariably has the higher rate. Whether this is attributable to Catholic upbringing 
or whether some artifact such as greater utilization of hospitals is the basis for this 
finding is not yet determined. 
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With regard to the developmental model, Garmezy (72) has recently reviewed 
the entire field. Among the most vocal proponents ofthe role ofthe developmental 
model, especially with regard to family structure, is Lidz (138). He makes a good 
case for the degree of warping that may occur in families of the deviant type, but 
unfortunately provides no normal control groups for contrast. The dire role that 
family interaction plays in the development of schizophrenia has recently been 
deflated by Manfred Bleuler’s long-term study of 208 schizophrenic probands and 
their families (12). It is interesting to note that Manfred Bleuler, who grew up at 
the Bürgholzli Hospital, found no help in diagnosis or in understanding his 
patients from the Rorschach technique which was administered to all the pro- 
bands and their relatives. One of the more striking findings in the developmental 
model is the lack ofintimacy which characterizes the adolescent friendship pattern 
of preschizophrenics (125). Another set of findings are the poorer scores on a 
visual reasoning factor, on a social conformity factor, and on an impulse control 
factor achieved by a group of Project Talent subjects at the time they took the tests 
in 1960, and who subsequently were hospitalized for mental disorder (8). 

The genetic model is spreading from the field of schizophrenia into other 
diagnostic categories such as depression (56), neurosis (231), and even criminality 
(274). The introduction of the study of adoptees for contrasting biological (genet- 
ic) and environmental (rearing) factors has demonstrated the strength with which 
the genetic component will make itself felt independently of rearing, but this 
victory of the geneticists has been gained at some sacrifice. The inheritance of 
schizophrenia, though undeniable, is no longer specific, but lies along a continuum 
spreading from genotypes whose phenotype is quite normal to those who are 
schizoid and to the more severely afflicted (114). One of the most interesting 
contributions of the introduction of the adoptee studies is the finding that the slow 
reaction time exhibited by schizophrenics might be due to rearing rather than to 
genetics.’ 

Genetics has also demonstrated its importance in the field of affective disorders. 
Although the earlier twin and family studies had stressed the importance of 
genetics in the transmission of manic-depressive psychosis, the mechanisms in- 
volved were not known until the recent findings concerning linkage with markers 
on the X chromosome—color blindness and Xg blood group—in the families of 
bipolar depressives. The most recent findings in the genetic area were reported at 
the recent symposium by the American Psychopathological Association (56). 

The internal environment model, which has been in the doldrums for awhile, is 
again in the forefront of scientific promise (113). The early work on the internal 
environment model proved disappointing because it dealt with peripheral blood 
flow and metabolic correlates in the brain. Furthermore, it was focused on discov- 
ering some unique toxic substance not present in normals. It was not until central 
synaptic mechanisms became the focus of attention that progress became appar- 
ent. Furthermore, the search for uniquely novel substances gave way to searching 
for inappropriate interconnections or interactions between chemically normal 


TJ. L. Van Dyke, D. Rosenthal & P. V. Rasmussen. Schizophrenia: effects of inheritance 
and rearing on reaction time. Unpublished manuscript. 
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components of the brain. Now at least three hypotheses seem promising. The first 
is the transmethylation hypothesis, which stipulates that the accumulation of 
methylated hallucinogenic substances underlies schizophrenic behavior. The se- 
cond hypothesis postulates that the disturbance in central catecholamine synapses 
underlies the vulnerability in schizophrenia and is based on the similarity to 
schizophrenia noted in the toxic psychosis induced by amphetamine. Apparently 
release of dopamine in the brain is the culprit. The third hypothesis postulates that 
monoamine oxidase which is markedly reduced in the platelets of schizophrenics 
may serve as a marker for schizophrenia (113). 

Can these various scientific models be integrated in some way? If we could 
squeeze out the essence of the six models into a goblet, we would find that the 
common element in all of these models is the production of some kind of vulner- 
ability in the mental patients. When they are subjected to sufficient stressful 
situations in life, this vulnerability produces an episode of illness. We know that 
depressive or manic episodes are time limited; that schizophrenic episodes are also 
time limited remains to be demonstrated, since some schizophrenics appear to 
remain ill indefinitely. From our prognostic literature we know that good pre- 
morbids improve while poor premorbids do not. If we make the assumption that 
both good and poor premorbids have time-limited episodes from which they 
recover, it is possible to explain the paradox. Both good and poor premorbids 
return to their premorbid level when the episode is finished. The good premorbids 
recover and return to their premorbid status in life, but the poor premorbids, even 
when the episode is over, still cannot cope. We are not even sure that the episode is 
finished; furthermore, because of their high vulnerability, they are catapulted into 
another episode soon after the first is ended. The problem facing clinical research 
is how to improve the poor premorbids so that they will be able to cope with life’s 
exigencies when an episode ends. Perhaps an investigation of good premorbids 
who fail and of poor premorbids who succeed may hold the key to the solution 
(293).8 
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therapist attitudes towards 
poor, 522-23 
untreated mental disorders 
among poor, 569 
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Classification 
and training 
in children, 99 
Cloze Test, 
see Tests and scales 
Cocaine 
incidence of use, 51 
see Drug use and abuse; 
Stimulants and depres- 
sants 
Cochlea 
in auditory evoked response, 
240 
in complex sound research, 
211-18 
Coding 
in memory 
phonemic and visual, 
297 
Cognition 
cognitive style 
individual differences in, 
106-7 
definition of, 173 
effects of language on 
in children's learning, 99 
literature on 
review of, 101 
in schizophrenics, 597 
Cognitive competency 
among young children, 111 
Cognitive dissonance 
test of theory, 35 
Cognitive flexibility 
as stable through adulthood, 
73 
Cognitive maps 
of space 
chimpanzee studies, 196 
Cognitive mistracking ` ` 
theory in perception, 269 
Cognitive processes 
in adult development and 
aging, 71-77 
Cognitive style 
imitation of 
among children, 113 
and level ' 
in high vs low redundancy 
schizophrenia, 602-4 
and personality 
study of, 80 
in schizophrenia assessment, 
599-615 
sex differences in 
in personality, 407-8 
in test validity 
for minority groups, 486 
College Characteristics Index 
(CCI) 
see Tests and scales 
College and University Envi- 
ronment Scale (CUES) 
see Tests and scales 
Color 
and SE response studies, 
235 


and form 
and concept identification 
research, 186 
in schizophrenic language 
studies, 613-14 


in visual perception research, 


269-76 
Combination tones _ 
psychophysiological origins 
of, 211 
Commitment 
in attitude studies, 415 
437-38 
and involvement 
in attitude research, 436 
Communes 
sex-role stereotyping in, 
A ` 
Communication 
nonverbal 
biological role in, 405-7 
sex differences in l 
among children, 120-21 
skills 
and role-taking, 110-12 
Communication problems 
in marital couples 
and behavioral analysis, 
628 
Communities 
age integrated 
studies on, 82 
Community psychology, 570- 
71 


Comparative studies 
of therapies, 510-12 
Compensation 
and conservation 
studies on, 101 
Competition 
and circadian rhythms 
in animal behavior, 141 
Complex sounds 
steady state 
timbre of, 216 
Compulsions 
and behavior therapy, 
526 
Computer analog models 
in thinking research, 174 
Concept attainment 
and problem solving 
in adult development, 74 
see also Thinking 
Concept formation 
imitation of 
` among children, 113 
studies, 101 
Concept identification 
complete and rule learning 
paradigms, 185-86 


complex attribute identifica- 


tion 
models of, 182-86. 
stimulus and encoding 
variables, 184 
critique of literature, 186- 


87 
focusing in 
role of, 182-84 
mathematical models of, 
179-82 
assumptions of focusing, 
179-80 
stochastic models of, 179- 
81 
strategies of, 182-84 
and thinking, 179-87 ; 
transfer paradigms, 185 
trial outcomes and response 
categories, 184-85 


Concepts 


natural 
and concept identification 
research, 186-87 


Conditional reasoning 


experiments on, 190-91 


Conditioning 


cellular mechanisms of 
in the cat, 375-76 
classical 
behavioral analysis methods 
in, 625-27 
operant and respondent 
in behavioral analysis, 
623 
operant model 


- and methods of behavioral 


analysis, 627-29 
theories 
in hypnosis, 22-23 


Consciousness 


altered 
effect on evoked responses, 
245-46 
expanding groups 
see Encounter groups 


Conservation 


and compensation 
studies, 101 
and training 
in children, 99 


Context theory 


in visual perception, 268 


Contiguity and similarity 


as fundamental determinants 
of memory, 292 


Contingent negative variation 


(CNV) 

interpretations of, 242 

in schizophrenic arousal, 
605 

see also Electrophysiology 
and behavior 


Coprophagy 


in rats 
and daily rhythms, 158 


Copulation 


cycles of 
influence of illumination on, 
161 


Corneal reflection 


in infants 
improvements for, 108-9 


Cornell JDK 
see Tests and scales 
Corticosterone 
rhythms 
entrainment of, 142 
in seasonal cycle studies- 
in sparrows, 153 
Counseling, 337-50 
and client behavior 
process research in, 
348 
confrontation and self- 
exploration 
studies of effectiveness, 
342-43 
empathy 
and other therapist-offered 
conditions, 338-40 
evaluation of psychotherapy, 
345-47 
human helping skills 
training in, 343-44 
interaction between counsel- 
or and client 
study of, 338-39 
prevention of mental dis- 
orders 
role in, 560 
role-theory in, 340 
selection of counselors, 344- 
50 
criterion problem, 345- 
47 
relationship as intermedi- 
ate outcome, 347-50 
self-disclosure 
studies of effectiveness, 
340-42 
and student development, 
337-61 
. therapeutic systems 
diversity of opinion, 348 
therapist training theories, 
347-50 
training innovators 
listing of, 343 
Covert reinforcement 
as behavior therapy, 537- 
38 i 
Creative functioning 
and masculinity-femininity, 


Creativity 
and cognitive style 
study of, 80 
and hypnosis 
study of, 24 
and hypnotic responsiveness, 
27-28 
and personality 
and Jungian thought, 397 
sex differences in, 104-6 
Crime 
as drug addietion indicator, 
47 
Criminality 
genetic model of, 659 


Crisis intervention 
in preventing mental dis- 
orders, 573, 575 
Cross-cultural studies 
of balance theory, 433 
of mother-infant interac- 
tions, 121-22 
on perception of pictorial 
materials, 272 
of personality 
proposal for, 397 
in Ponzo effect, 268-69 
psychopathology assess~ 
ments, 686-41 
in reasoning, 192 
Cross-national studies 
diagnosis 
in mental disorders, 637 
differences in depression 
diagnosis, 633 
of sex-role socialization, 
402-3 
Cross-racial bias 
in facial recognition, 279 
Cross-sectional studies 
of adult personality struc- 
ture, 81-82 
of student development, 351- 
53 
Cryptarithmetic 
and problem solving process- 
es, 177. 
Cultural differences 
in color naming 
theory of, 269-70 
Cultural factors 
and failure of children, 
100 
in family interaction, 121- 
22 
in prosocial behavior, 116 
Culture 
effects on mental disorder 
assessment, 640-41 
and emotional expression, 
408 
and personality 
and psychoanalytic theory, 
397 
subjective 
and attitude research, 
416 
Cyclazocine 
in addiction treatment, 59 
Cyclic adenosine monophos- 
phate (cAMP) 
neuronal plasticity, 381-82 


D 


Dating couples 
and fear of success 
studies of, 7 
Deaf f 
language and cognition in 
relation of, 100 
language studies 
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verbal mediation, 194 
Death 
attitudes of aged toward, 
83 


cerebral 
EEG and ER as measures 
of, 246 
prediction of 
inadult development studies, 
70-71 
Deception and suspicion 
as methodological issues 
in attitude experiments, 
420-21 
Decision-making 
and dissonance studies, 
425 
Decrement models 
in adult development studies, 
66-68 
Defensiveness 
in personnei attitudes 
and motivation, 469 
Depression 
behavioral analysis of, 543, 
625 
characteristic symptoms of, 
629 
classification of, 632-36 
neurotic-psychotic dichot- 
omy, 633-35 
numerical typologies of, 
635-36 
reliability of diagnosis, 
632-33 
criteria for diagnosis 
table of, 646-47 
cross-racial assessment of, 
638-39 
and fatigue 
desensitization in treating, 
528 
genetic model of, 659 
measurement of, 630-32 
rater perception, 632 
rating scales, 630-31 
Sleep disturbance in, 631- 
32 
measurement and classifica- 
tion of, 629-36 
Desensitization 
behavior therapy 
role in, 527-31 
compared with flooding, 
35 
compared with other therapies 
in Di Loreto study, 511 
in counselor training, 344 
definition of, 527 
reviews applying to, 539 
Deterioration vs improvement 
in various therapies, 517- 
Determinism 
in expectancy theory, 461- 
62 
Deuterium oxide 
effects on biological rhythms, 
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148-49 
Developmental chang 
correlational methods in 
study of, 70 
Developmental psychology, 
97-126 
and aging 
studies on, 71 
cognition, 101 
intellect creativity and 
achievement 
cognitive style, 106-7 
individual differences in, 
101-7 
sex differences in, 104- 
6 
introduction, 97 
language, 99-101, 
learning, 98-99 
logie and thought 
problem solving, 187-88 
memory 
children's acquisition 
strategies, 107-8 
experimenter interventions, 
108 
semantic integration, 108 
studies of, 107-8 
perceptual-motor develop- 
ment, 108-10 
motoric-graphic skills, 
109 
spatial reference systems, 
109-10 
race,social class,and intelli- 
gence 
as achievement factors, 
101-4 
role-taking and communica~ 
tion skills, 110-12 
methodological advances, 
112 
theoretical advances, 110- 
12 
social and personality devel- 
opment, 112-25 
family interaction, 121- 
23 
family structure, 123-25 
modeling, 112-14 
moral development and 
self-control, 114-19 
racial awareness and 
attitudes, 119-20 
sex differences in social 
development, 120-21 
Dichotic listening 
in adult memory studies, 
76 


Di Loreto study 

comparing different psycho- 

therapies, 511 

Diphenylhydantoin (Dilantin) 

effects on CNS activity, 247 
Diplacusis 

binaural, 213-14 
Discrimination 


of complex sounds, 212 
Disengagement theory 
and adult personality, 79- 
80 
Dishabituation 
plastic changes underlying, 
372-73 
Dissonance 
in attitude research, 421- 
25 
definition and occurence, 
424 
in therapist observations, 
542 
Distance 
in perception studies, 270- 
72 
Distraction 
importance in children's 
learning, 98-99 
Dogmatism 
as personality variable 
in attitude study, 438-39 
Dopamine (DA) 
effects on circadian rhythms, 
149 
in toxic psychosis, 660 
Down's syndrome 
and evoked response 
research, 247 
as organic mental disorder, 
564 
Draw-a-Man test 
see Tests and scales 
Dreaming 
evolutionary adaptedness of 
and personality, 396 
Dreams 
and personality, 410 
see also Sleep 
Drinking 
excessive 
behavioral analysis of, 
543 
Drug addiction 
prevention of, 561 
Drug Education Programs 
ineffectiveness of, 57-58 
Drug effects 
on behavior 
and biological rhythms, 
150-51 
on CNS activity, 247-48 
on evoked responses, 236 
. On patient test performances, 
652 
Drug use and abuse, 45-60 
etiology and methods of 
spread, 55-56 
introduction, 45 
marijuana and hallucinogens 
effects of, 56-57 
methadone treatment, 48 
among military personnel 
studies of, 51-53, 55 
nonmedical 
reviews of, 45 


nonmedical users 
surveys of, 48 
patterns of use 
frequency, amount, and 
duration, 51-55 
hallucinogens, 53-54 
marijuana, 52-53 
opiates, 51-52 
stimulants and depres- 
sants, 54-55 
polydrug abuse, 50-51, 54, 
60 
prevalence and trends, 47- 
51 
hallucinogens, 49-50 
marijuana, 49 
opiates, 47-49 
stimulants and depres- 
sants, 50-51 
prevention, 57-58 
psychedelic drugs 
and heroin addiction, 49 
recent epidemic, 46 
socioeconomic aspects of, 
55-56 
treatment, 58-60 
Dual-process theory 
limited capacity of STS 
and short-term forgetting, 
304-7 
in memory, 295-308 
Dutch infants 
in family interaction studies, 
121 


E 


Ear 
outer 
role in sound localization, 
220 
Eating problems 
aversive conditioning in, 
531 
Ecological variables 
in mental disorders, 571- 
73 
Education 
and age differences on 
Piagetian tasks, 74 
in prevention of mental dis- 
orders, 575 
stereotypic thinking in 
schools, 4 
Edwards Personal Preference 
Schedule 
see Tests and scales 
EEG 
age differences in, 78-79 
hypnotic responsiveness 
correlates of, 28 
and imagery tasks, 195 
in schizophrenic arousal, 
605-6 
sleep disturbance in depres- 
sion, 632 
Ego 


development 
in personality research, 
399-401 
strength and defense 
in personality, 410 
Egocentrism 
and developmental theory, 
110-11 
Electric shocks 
as modifying hypnotic sus- 
ceptibility, 27 
Electrophysiology and behav- 
ior, 233-48 
auditory evoked response 
(AER), 240-41, 243, 245- 
47 


contingent negative variation 
(CNV), 242 
evoked responses 
and animal learning, 247- 
48 
effects of sleep anesthesia, 
245-46 ) 
methods and techniques, 
235 
reliability stability, validity, 
234-35 
taste and smell stimuli, 
241-42 
extralemniscal hypothesis 
evidence against, 237-38 
information delivery 
stimulus uncertainty— P3 
wave, 244-45 
information processing, 243- 
48 


attention, habituation, 
selective attention, 243- 
44 


intelligence and mental retar- 


dation, 246-47 
introduction, 233-34 
neurogenesis of the evoked 

response, 236-38 
psychological variables 

and contingent negative 

variation, 245 
sensory receiving, 238-42 
somatosensory evoked 

responses (SERs), 241, 

243, 245-47 
statistical and mathematical 

analyses, 235-36 
visual evoked responses 

(VERs), 234-35, 238-40, 

243, 245-47 
see also Evoked responses; 

Contingent negative varia- 

tion 

Electrosleep therapy 
evaluation of, 516 
Elgin scale 
see Tests and scales 


EMG 
correlate of speech and 
thought, 195 


feedback 


in modifying hypnotic 
susceptibility, 27 
of schizophrenics, 651 
of tongue during mental 
arithmetic, 193 
Emotion 
and motivation 
and attitude research, 
416 
and personality, 408-10 
universal facial expressions 
of, 408 
Emotional development 
in girls 
effect of father absence 
on, 124 
Emotional expression 
biological role in, 405-7 
Employment 
equal opportunity 
review of court cases, 
486-87 
see also Personnel attitudes 
Encoding 
in memory 
characteristics of, 293 
dimensions of, 313-15. 
variability and specificity 
of, 315-22 
variables in concept identi- 
fication, 184 
Encounter groups 
deterioration among partici- 
pants, 517-18 
as modifiers of hypnotic 
susceptibility, 27 
as psychotherapy substitute 
for women, 9 
Environment and heritability 
in old age behavior, 67 
Environmental complexity 
cerebral effects of, 385 
Environmental psychology 
and behavioral analysis, 
625 
Enzymes 
synchronization of activity 
and circadian rhythms, 
141 
Epidemiology 
of mental disorders, 567- 
73 
psychiatric, 636-43 
see also Psychiatric epide- 
miology 
Epileptic seizures 
reduction of 
by training, 248 
Epinephrine 
role in synaptic plasticity, 
381 


substitution of hypnosis for, 
35 
Escape response 
habituation of 
neurophysiology of, 370 
Estivation 
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as seasonal adaptation, 
151, 155 
Estrogens 
role in sexual receptivity 
in rodents, 160 
Estrous cycle 
and biological clock in 
hamsters, 160 
and biological rhythms, 
147-48 
Ethical behavior 
and reasoning, 114 
Ethnic 
influences on children's 
language, 99-100 
Ethnic differences 
in job performance predic- 
tion 
in personnel testing, 488- 
89 


Ethyl alcohol 
rn on biological rhythms, 


Euthanasia 
attitudes of aged toward, 
83 


Evoked response 
and animal learning, 247- 


and attention, habituation, 
and selective attention, 
243 
effects of selective attention 
on, 244 
intelligence and mental re- 
tardation, 246-47 
in schizophrenic arousal, 
605 
see also Electrophysiology 
and behavior 
Evolution 
and female hypersexuality, 
11 
Exhibitionism 
aversive conditioning in, 
531-32 
Expectancy theory (V-I-E) 
critique of, 459-65 
hedonism, 459-62 
infinite regress problem, 
464-65 
irrational behaviors, 464 
ratio scale problem, 464 
search-load problem, 463 
subconscious motivation, 
463-64 
time span, 463 
and fear of Success 
in women, 6 
and goal theory 
relationship between, 467- 
68 
and personnel attitudes and 
motivation, 458-65 
Experiment 
social psychology of 
methodological issues, 


DH 
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419-20 
Extroversion 
and personality 
role of biology in, 403-5 
Eye contact 
and personality 
biological role in, 405-7 
Eysenck Personality Inventory 
see Tests and scales 


F 


Facial expressions 
and emotion, 408-9 
Factor-analytie studies 
of adult personality in age, 
81 


Family 
guidance 
role in preventing mental 
disorders, 560 
interaction 
in social and personality 
development, 121-23 
programs 
preventing mental disorders, 
575 
of schizophrenic 
role of, 659 
sex-role socialization 
role in, 2, 402 
Family structure 
in social and personality 
development 
birth order, 124-25 
fatherless families, 123- 
24 
parent-child interaction, 
122-23 
parent-infant interaction, 
121-22 
Fantasy 
in behavior therapy, 537 
goal-directed 
in hypnosis, 23 
involvement 
and hypnosis, 19-20 
Fatherhood 
‚and sex roles, 2 
Fear of success (FOS) 
in women, 6-7 
. Fear Survey Schedule 
see Tests and scales 
Federal Service Entrance 
Examination 
see Tests and scales 
Feedback 
as behavior therapy, 539 
Female psychosexual develop- 
ment 
biosocial theory of, 2 
Freudian theory of 
criticism of, 8-13 
Field dependence 
related to disengagement 
and personality, 80 
and Rod-and-frame test 
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measured by, 274-75 
Finances 
effect of 
on college attendance, 359- 
60 . 
Flexion reflex 
neurophysiology of, 368- 
69 
Flooding or implosion 
compared with desensitiza- 
tion, 534-35 
reviews of, 539 
Focusing 
role in concept attainment 
and problem solving, 182~ 
84 
Follicle stimulating hormone 
(FSH) 
in seasonal cycle studies, 
153, 155 
Free recall 
in memory, 300-1, 314, 
316-18 
Frequency 
` distributions 
of depressed patients, 635 
modulation 
auditory pathways for, 
212 
Freud 
criticism of sexual theory, 
8-13 
Frigidity 
treatment of, 9-10 
Functional age 
in adult development studies, 
70 


G 


Galvanic skin response 
age differences in, 78 
in schizophrenic arousal, 
605 
Gambling 
and aging 
study of, 82 
General Aptitude Test Battery 
see Tests and scales 
Generational change 
in adult development studies, 
12-73 
Genetics 
in aggression, 403 
counseling 
in mental disorder preven- 
tion, 560 
effects 
of mental disorders, 562 
factors. 
in schizophrenia transmis- 
sion, 601-2 
and intelligence levels, 102- 


3 
Gestalt theory 
in low pitch theory, 
215 


in thinking research, 173- 
75 


Goal setting 
and management by objectives 
in personnel research, 
465-68 
Goal theory 
and expectancy theory 
relationship between, 467- 
68 
Gonadotropins 
cyclic release of 
in rats, 160 
Gordon Personal Profile and 
Personal Inventory 
see Tests and scales 
Graduate Record Examination 
(GRE) 
see Tests and scales 
Graeco-Latin Square 
in problem solving, 178 
Graphic skills 
developmenial studies of, 
109 
Group 
pressures on employees, 
475 
social psychology of 
and attitude research, 
416 
Group Therapist Orientation 
Scale 
see Tests and scales 
Guatemalan children 
aggression and popularity, 
118 


H 


Habituation 
in infants 
methodological improve- 
ments in, 109 
in information processing, 
243-44 
invertebrate studies, 369- 


74 
behavioral, 370 
locus of plastic changes, 
372-73 
neural circuits and escape 
responses, 371-72 
peripheral mechanisms, 
370-71 
as neurophysiological func- 
tion, 368-74 
in schizophrenics, 605 
synaptic change during 
mechanism of, 373-74 
vertebrate studies, 368-69 
Hallucinations 
aversive conditioning in, 
531 
as hypnotic phenomena, 31- 
32 


psychotic auditory 
and behavior therapy, 528 


Hallucinogens 
effects of use, 56-57 
and hippie movement, 46 
patterns of use 
frequency, amount, and 
duration, 53-54 
prevalence of use, 49-50 
see Drug use and abuse 
Halstead-Reitan Test Battery 
see Tests and scales 
HALMA 
problem solving game, 
177 
Handedness 
see Brain 
Handwashing compulsions 
desensitizaticn in treating, 
528 
Harmonics 
frequencies of 
codes for, 214 
presentation of 
dichotic vs monotic, 211- 
12 
in spatial frequency, 276 
Harvard Student Study 
of personality development 
longitudinal, 355-56 
Hashish 
chronic users, 53 
see also Marijuana 
Headache 
tension 
treatment by relaxation, 
539 
Health 
in age difference studies, 
82 
Health-Sickness Rating Scale 
see Tests and scales 
Hearing 
binaural 


reviews on research, 219 


Heart rate 
differences 
in schizophrenic arousal, 
605 
variations in 
as aging factor, 79 
Heart rate studies 
relevance to redundancy 
formulation 
in schizophrenia, 611-13 
Heart and stress symptoms 
treatment through hypnosis, 
34 


Hedonism 


as flaw in expectancy theory, 


459-62 
Helping behavior 
among preschool children, 
410 


and reactance 
in attitude studies, 436 
and stereotypic sex roles, 
4 
Hemispheric laterality 


Hippies 


and hypnosis, 20 
Hepatitis 
as drug addiction indicator, 
47 
Heritability l 
of biological clocks, 16 
and environment 
in old age behavior, 67 
Heroin 
consumption in mg/day, 
51 


spread as contagious disease 


model, 55 
trends in use, 47-49 
see also Opiates 
Herzberg theory 
and job enrichment, 468- 
71 


Hibernation 
as biological rhythm, 137 
as circannual rhythm 
in various species, 154 
as seasonal adaptation, 151 
Hiccups 
treatment through hypnosis, 
34 


Hippie movement 
relation to drug epidemic, 
46 


study of 
in ego development, 399 
Hippocampus 
synapses in 
and neuronal plasticity, 
380 
Homographs 
studies of encoding using, 
317, 321. 
Homosexuality 
behavioral therapy in, 528, 
538-40, 542 
Honesty instructions 
in hypnosis, 21-22 
Hormones 
and seasonal behavior 
relationship to, 153, 155 
synergistic actions of 
and circadian rhythms, 
141 
see also specific hormones 
Human 
circadian rhythms 
research on, 138 
values 
and attitude research, 
415 


Humanism 


emotion and personality, 
409 
in personality theory, 398 
Humanistic model 
in personne! assessment, 
498 
5-Hydroxytryptamine (5-HT) 


effects on circadian rhythms, 


149 
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Hyperactive behavior 


and achievement striving, 
105 


Hyperoxygenation 


effects on memory and 
learning, 77 


Hypnosis, 19-36 


as adaptive regression, 23- 
24 

of animals, 33 

archaic involvement, 24 

cautions in use of, 35 

clinical applications of, 32- 
34 


conditioning theory of, 22- 
23 
contemporary history of, 
35-36 
and creativity, 24 
effect on evoked responses, 
245-46 
fantasy and imagination 
role in, 19 
future problems, 36 
historical reviews of, 36 
hypnotic responsiveness 
personality correlates of, 
. 27-28 
scales, 25-26 
the hypnotizable person, 25- 
28 
introduction, 19-20 
as investigating tool, 34- 
35 
measured differences in 
subjects 
stability of, 26 
monitering and controlling 
subjective states, 35 
multiple-aspect theories 
of, 24 
neodissociation theory, 19, 
24.25 
hidden observer in, 24 
neutral 
definition of, 29 
operational theory of, 23 
original task motivation 
instructions (OTMI) 
in support of nonstate 
theory, 21-22 
pain reduction studies, 29- 
30 
plateau responsiveness, 26 
posthypnotic amnesia 
in simulators, 22 
posthypnotic suggestion, 23 
psychoanalytic theory in, 
23-24 
psychophysiological corre- 
lates of 
unilateral control of skin 
temperature, 30 
real-simulator design 
in support of trance con- 
cept, 22 
research literature 
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major reviews of, 36 ° 
revised task motivation 
instructions (RTMI), 22 
role theory of, 23 
standardized scales 
supplements to, 26 
susceptibility 
hereditary factor in, 26 
modification of, 27 
symptom control 
in clinical practice, 33 
theories of, 22-25 
trance controversy, 20-22 
trance depth, 24 
as treatment for frigidity, 
9 
Hypnotic phenomena 
amnesia, 30-31 
illusions and hallucinations, 
31-32 
sleep and dreams, 32 
Hypnotic responsiveness 
EEG correlates of, 28 
personality correlates of, 
27-28 
psychophysiological corre- 
lates of, 28-30 
laterality of function, 28- 
29 


scales 
see Tests and scales 
Hypnotic susceptibility 
and age differences, 82 
Hypothalamic-pituifary axis 
effects of steroids on, 160 
Hypothalamus 
role in circadian rhythms, 
147 
and visual pathways, 143- 
44 
Hysterical blindness 
and behavior therapy, 526 


I 
Illusions 
developmental studies of, 
109 


as hypnotic phenomena, 31- 
32 


moon, 270 
Müller-Lyer illusion 
in schizophrenia, 600-1 
pattern anomalies, 267-69 
Poggendorff effect, 267-68 
Ponzo effect, 31, 268-69, 
278 
Pulfrich stereophenomenon, 
271 
and sensation 
in adult development, 77- 
79 i 
theory, 269 
twisted-cord effect, 267 
visual 
in adult development stud- 
ies, 69 


age difference studies, 
78 
Imagery 
in anxiety arousal, 535-36 
in behavior therapy, 530 
in learning and retention, 
99, 322-23 
measures 
see Tests and scales 
role in thinking and concept 
attainment, 194-96 
Imagination 
and fantasy 
in personality, 410 
and motor activity 
in young children, 112 
and play 
in personality development, 
410 l 
role of right hemisphere 
in, 20 
Imitation 
and language behavior 
in children, 99 
see also Modeling behavior 
Immigrants 
and mental disorder, 570 
Incentives 
monetary 
and personnel attitudes, 
471-73 
and rewards 
. in attitude studies, 430- 
31 
Incontinence 
and behavior therapy, 526 
Incubation 
in problem solving, 175-76 
Indians 
Blackfoot children 
and prosocial behavior, 
116 
Individual differences 
in cognitive style, 106-7 
developmental analysis of, 
70 
in fantasy involvement, 19- 
20 
in intellectual development 
and achievement, 101-7 
in operational theory, 23 
psychophysiological and 
genetic bases of, 403-4 
Individual variations 
in gaining logical inference, 
191 
Inequity theory 
in dissonance studies, 
425 
Infantile sexuality 
criticism of, 10-11 
Infants 
cognitive development stud- 
ies, 105 
evoked response studies on, 
234-35 
in family interaction studies, 


121-22 
form perception, 266 
language abilities, 99-101 
motor development in 
and evoked response 
research, 246 
ocular defects in 
and perception, 276 
perceptual-motor develop- 
ment in, 108-9 
and visual evoked response 
studies, 240 
Influence 
attitudes toward 
research on, 430 
Inhibition paradigm 
retroactive > 
age differences with, 76 
Insomnia 
treatment through hypnosis, 
34 : 
Intellectual 
ability and achievement 
individual differences in, 
101-7 
development 
in adulthood, 72 
and family interaction, 
122-23 i 
sex differences in, 104-6 
performance 
in adult development stud- 
ies, 73 i 
of deaf children, 100 
styles 
racial differences in, 410 
Intelligence 
age decrement in 
studies of, 68-69 
and evoked response 
research, 246-47 
as factor in intellectual 
development and achieve- 
ment, 101-4 ` 
‚stable through adulthood, 
73 
tests 
cultural bias in, 102-3 
dispute over use, 102 
Interaction 
in personality research, 
395-96 
Intergenerational differences 
in adult development studies, 
69-71 
psychological aspects of, 
70 


Lé 


International Personnel Man- 
agement Association 
series 

see Tests and scales 
Interpersonal 
attraction . 
and attitudes, 416, 441-42 
inference 
sequential model for, 110- 
12 


Interview 
structured 
in biometric approach to 
psychopathology, 622, 
630, 641, 648, 656 
Intimacy vs isolation crisis 
in college men 
study of, 399 
Introversion-extroversion 
see Personality 
Involvement 
and social judgment theory 
in attitude research, 435- 
36 
IQ 
and creativity 
in children, 104 
tests 
conflict over use, 102 
see also Tests and scales 


d 


Job analysis 
and personnel testing, 483- 
84 
Job-sex patterns 
cross-cultural, 4 
Jung 
rediscovery of 
in personality research, 
397-98 
word association test 
see Tests and scales 
Junior Professional Assistant 
exam 
see Tests and scales 
Jury decisions 
studies of 
and interpersonal attrac- 
tion, 441 
Juvenile delinquency 
prevention of, 561 


K 


Kappa coefficient 
statistic correcting for 
chance, 633, 648 
table of, 647 
Kelly Rep Test 
see Tests and scales 
Kleptomania 
and behavior therapy, 528 


L 


Language 
behavior and imitation 
in children, 99 
and cognition 
in deaf children, 100 
effects of 
on mental disorder assess- 
ment, 640-41 
as encoding feature in mem- 
ory, 313 


literature on 
review of, 99-101 
schizophrenic, 613-14 
Sino-Tibetan 
conditioning in, 435 
styles 
sex-role stereotyping in, 
4 
and thought, 192-96 
imagery, 194-96 
inner speech, 193 
private speech, 193-94 
in understanding logic, 
189-90 
verbal mediation, 194 


- Lateral suppression 


and continuity effect, 216- 
17 


Latin square 
in problem solving, 178 
Learning 
in adult development studies, 
75 
animal 
and evoked response, 247- 
48 
biological rhythms in, 162 
and conditioning of attitudes, 
434-35 
use of hypnosis in, 31, 34- 
35 


imagery in 
role of, 322-23 
literature on 
review of, 97-99 
neurophysiology of, 367-85 
redundant cues 
effects of on, 184 
and social behavior 
in attitude research, 
416 
synaptic plasticity as mech- 
anism of, 378-85 
as theory of hypnosis, 22- 
23 


see also Neurophysiology of 
learning 
Light 
and circadian rhythms, 139- 
44 
entrainment by 
physiological basis for, 
141-42 
responses to 
neurophysiology of, 371, 
384 
Light stimulation, 
and morphological changes 
in synapses, 384 
Linguistic 
deficiency 
ethnic influences in child- 
ren, 100 
Linguistic context 
of analytic-global style 
in personality, 407 
see also Language 
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Literature 
children's 
sex-role stereotypes in, 
4 
Lithium 
in schizophrenia treatment, 
602 
Locomotor activity 
influenced by colored light, 
143 
in seasonal cycles, 153 
Locus of control 
as personality theory, 396 
Logic 
symbolic 
and problem solving pro- 
cesses, 177 
and thought, 187-92 
conditional reasoning, 190- 
91 
formal operations, 191 
generalization and abstrac- 
tion, 191-92 
logical symbols and terms, 
189-90 
syllogistic reasoning, 189 
transitive inference, 187- 
89 s 


' Logical 


decision tree 
computer diagnosis model, 
644-45 
inference 
age of development, 191 
terms 
comprehension of, 189- 
90 


Longevity 
psychological concomitants 
of 
studies of, 71 
Longitudinal studies 
of adult personality develop- 
ment, 81 
on cohorts of births 
and mental development, 
642 
Fels Institute data 
IQ sex differences andage, 
105 
of sex-role socialization, 
402-3 
of student development, 351, 
353, 355 
of worker adjustment 
to 4-day week, 475 
Loudness , 
experiments on, 215-16 
Love 
and attraction 
theories of, 442 
Loxapine 
efficacy of 
International Collaborative 
Study, 641 
LSD 
freakout 
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treatment by relaxation 
training, 539 
psychological effects of, 57 
see also Drug use and abuse; 
Hallucinogens 
Luteinizing hormone (LH) 
in seasonal cycle studies, 
153, 155, 160 


M 


Malan studies 
of psychoanalytical therapy, 
514 
Management by objectives 
(MBO) 
and personnel attitudes 
and motivation, 465-68 
Manic depressive disorders 
etiology of 
and biological milieu, 643 
and industrial societies, 571 
inheritance of, 659 
prevention of, 561 
vs unipolar depression, 629 
Marijuana 
effects of use, 56-57 
patterns of use 
cross-cultural comparisons, 
52-53 
frequency, amount, and 
duration, 52-53 
surveys on use, 48-49 
use of 
social and psychological 
correlates of, 55-56 
users 
characteristics of, 56 
see also Drug use and abuse 
Marital therapy 
deterioration in, 518 
Marriage 
and college attendance 
effects of on, 359 
counseling 
in mental disorder preven- 
tion, 560 
sex roles in, 2 
Masculinity-femininity 
conceptions of, 2 
evolution of, 11-12 
measures of, 3-4 
review of literature on, 3 
sex differences in personal- 
ity, 401-3 
Masking 
in auditory experiments, 
210, 216-18, 221-22 
lateral 
in studies of reading, 273- 
74 
Maternal behavior 
and child's cognitive style, 
107 
in circadian rhythms, 145 
Mathematical 
problem solving, 176-77 


reasoning, 192 
Maudsley Personality Inven- 
tory 
see Tests and scales 
Measurement 
and scale construction 
in attitude research, 417 
Mediational literature 
on children's learning 
reexamination of, 98-99 
Melancholia 
involutional 
as unipolar depressive 
disorder, 632-33 
Memory 


in adult development studies, 


75-77 
attributes of, 325-27 
frequency discrimination 
and recognition, 326 
repetition and representa- 
tion of events, 325 
auditory presentation in, 
298-99 
biological rhythms in, 162 
in concept 
attainment, 182 
identification, 179 
cAMP 
role of in, 381 
decay and displacement, 
304-6 
dual-process theory, 295- 
308 
capacity of STS and short- 
term forgetting, 304-7 
coding in STS, 296-300 
critical considerations, 
307-8 
formation of, 295-96 
long-term and short-term 
systems, 295-96 
rehearsal and transfer to 
LTS, 300-3 
encoding 
specificity, 319-22 
syntactic features of, 313 
variability, 315-18 
encoding dimensions 
phonological attributes, 
313-14 
word, 313-15 
free-recall experiments, 
300-1, 314, 316-18, 324 
historical trends, 291-95 
hypnosis 
use of in investigating, 34- 
35 
imagery in, 322-23 
interference theory and 
forgetting, 327 
literature on 
review of, 107-8 
maintainence rehearsal in, 
296, 303, 309 
mnemonic organization in 
STS, 296 


negative 
recency effect, 301-2 
transfer, 324 
omissions from the review, 
327 
and operative thought, 195 
organization of, 323-25 
phonemic encoding, 297-98 
proactive inhibition (PI) 
mechanisms of, 306-7 
release from, 313-14 
in problem solving 
role of, 178 
processing levels, 309-13 
and incidental memory, 
311-13 
recall , 
as retrieval and recogni- 
tion, 319-22 
recency peak in, 304-5 
semantic-associative pro- 
cessing, 309-12, 314-15, 
321 
semantic encoding 
debate over, 299-300 
storage of whole ideas, 314- 
15 
transfer 
and interference problems, 
316 
studies, 318 
and verbal learning, 291- 
327 
Men 
role change 
implications of, 2 
Menninger Study 
of psychoanalysis 
long term, 512-14 
Mental deficiency 
prevention of, 561 
Mental disorders 
assessment 
across racial boundaries, 
638-40 
effects of language on, 640- 
41 i 
use of case registers in, 
641-43 
behavioral analysis of, 625 
classifications of, 561-67 
computer diagnosis of, 643- 
45 
models of, 656-60 
organic causes, 564 
primary prevention of, 557- 
78 


disturbance models, 560- 


epidemiology and ecology, 
567-73 

introduction, 557-60 

specific programs, 575- 
76 

summary and conclusions, 
576-78 

vulnerability in, 660 


Mental hygiene programs 
role in prevention of mental 
disorders, 560 
Mental retardation 
and behavior therapy, 540 
and evoked response 
research, 246-47 
Mescaline 
See Drug use and abuse; 
Hallucinogens 
Mesmerism 
early documents on, 36 
Methadone maintenance treat- 
ment 
of narcotics addicts, 49, 
58-59 
Methamphetamine 
see Stimulants and depres- 
sants 
Methaqualone 
see Stimulants and depres- 
sants 
Mexican- Americans 
and achievement testing, 
103 
children in prosocial behav- 
ior study, 116 
Michigan State Affective Sen- 
sitivity Scale 
see Tests and scales 
Migraine headache 
treatment through hypnosis, 
33 


Migration 
as biological rhythm, 137 
as seasonal adaptation, 151 
Mills 
game 
problem solving analysis 
of, 177 


Minnesota Multiphasic Person- 


ality Inventory (MMPI) 
see Tests and scales 
Minority groups 
college students, 359-60 
drug addiction among, 49 
prevention of mental dis- 
orders among, 575 
Mitochondria 
affinity for calcium 
and synaptic plasticity, 
380, 382 
Modeling 
in aggression 
importance of, 117-18 
and altering ethical judg- 
ments 
in children, 115-16 
in altruistic behavior 
importance of, 117 
applications of 
reviews of, 540 
behavior 
as modifying hypnotic sus- 
ceptibility, 27 
therapy in, 536-37 
in social and personality 


development, 112-14 
Models 
of complex attribute identi- 
fication 
in concept identification, 
182-86 
computer analog 
in thinking research, 174 
for computer diagnosis of 
mental disorders, 644-45 
of concept attainment, 179- 
82 
of low pitch extraction, 214- 
15 
of mental disturbance, 560- 
67 
of photopericdic time mea- 
surement 
in seasonal cycles, 151- 
54, 
of problem solving process- 
es, 177-78 
of psychopathology, 656-60 
Modus ponens 
discussion of, 190 
Monkeys 
mental disorders among, 
566 
neurosis among, 516 
Moral conduct 
models of, 399-401 
Moral development 
and adult personality, 80 
in children, 114-19 
and family interaction 
studies of, 122-23 
in personality research, 
399-401 
and self-control 
aggression, 117-18 
moral judgments, 115-16 
prosocial behavior, 116- 
17 
punishment, 118-19 
Moral judgments 
among delinquents 
and modeling behavior, 
536 
Morphological! changes 
in neuronal plasticity, 383- 
84 
Morphology 
and personality 
in obesity, 405 
Motherhood 
and sex roles, 2 
Mothering 
importance of 
in preventing mental dis- 
orders, 563 
lack of, and pathological 
behavior in monkeys, 
566 
Mothers 
role in self-control develop- 
ment, 119 
Motivation 
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and attributions to others, 
432 
behavioral periodicity in, 
138 
and emotion 
and attitude research, 416 
and personnel attitudes, 457- 
76 
positive and negative 
in expectancy theory, 461 
sex-differences in 
nursery age children, 105 
subconscious 
in expectancy theory cri- 
tique, 463-64 
Motivators 
and hygienes . 
in Herzberg theory, 468- 
71 
Motor activity 
and imagination 
in young children, 112 
and speech 
in children, 101 
Motoric-graphic skills 
developmental studies, 109 
Müller-Lyer display 
see Tests and scales 
Multidimensional scaling 
in attitude research, 415 
Muscular relaxation 
as modifying hypnotic sus- 
ceptibility, 27 
Mutism 
and behavior therapy, 526 
Myers-Briggs Type Indicator 
see Tests and scales 
Myopia 
improved in hypnosis, 31- 
32 


Myth 
role in stereotyping, 13 


N 


Naltrexone 
in addiction treatment, 59 
Nalzone 
as opiate antagonist, 59 
National Merit Scholarship 
Qualifying Test 
see Tests and scales 
National Teachers Examination 
see Tests and scales 
Navy Maze test 
see Tests and scales 
Navy Vocational Interest 
Inventory 
see Tests and scales 
Neodissociation theory 
of hypnosis, 19, 24-25 
Nervous typologies 
genetic role in, 403-4 
Neural control 
in circadian rhythms, 145- 
48 


Neural organization 
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opponent process model of 
in duality of human nature, 
405-6 
Neurogenesis 
of the evoked response, 236- 
38 
Neuronal plasticity 
model studies of, 377-85 
in protozoa 
studies of, 378 
synaptic 
calcium and increased 
transmitter release, 379- 
81 
cAMP, 381-82 
decrement and transmitter 
depletion, 380-81 
morphological changes, 
383-84 
quantal analysis, 378-79 
theoretical models, 382 
trophic interactions, 382- 


Neurophysiological theories 
in perception, 278 
Neurophysiology of learning, 
367-85 
associative learning, 374- 
77 
habituation, 368-74 
introduction, 367-68 
neuronal plasticity 
model studies of, 377-85 
protozoa, 378 
synaptic, 378-85 
Neurosis 
behavior theory 
in treatment of, 625 
existential, 572 
prevention of, 561 
genetic model of, 659 
in monkeys, 516 
NIM 
problem solving game, 177 
Noise 
as punishment 
in self-control studies, 
118-19 
Nonverbal communication 
and personality, 405-7 
Noradrenalin 
synthesis of 
and calcium, 380 
Norepinephrine (NE) 
effects on 
circadian rhythms, 149 
food intake, 150 
Norms 
social and personal 
and attitude research, 418 
Null-match techniques 
in visual perception, 264- 
65 
Nurturance 
effect on imitation 
in children, 114 
Nutritional factors 


in mental disorders, 562 
O 


Obesity 
behavior therapy in, 526, 
540 
and personality 
role of biology in, 405 
Obsessions 
aversive conditioning in, 
531 
behavioral analysis of, 543 
Occupational choice 
and stereotypic sex roles, 
4 
Occupational development 
in student studies, 356 
O'Connor Finger Dexterity 
Test 
see Tests and scales 
Operant conditioning 
see Behavior modification 
Operational theory 
of hypnosis, 23 
Operational thought 
andimagery, 194-96 
Opiates 
eticlogy and methods of 
spread, 55 
patterns of use 
frequency amount and 
duration, 51-52 
see also Drug use and abuse 
Optic tracts 
primary and accessory 
and circadian rhythms, 
143-44 
Orgasm 
importance of 
in Freudian theory, 11 
Oscillators 
compared to circadian 
clocks, 139-41 


P 


Dain 
reduction. 
through acupuncture, 33 
hypnotic experiments on, 
20, 23, 33-34 
and production, 29-30 
treatment by feedback tech- 
nique, 539 
Parallelism 
psychological 
and perception, 266 
Paranoia 
comparative hospital stud- 
ies, 603-4 
in schizophrenia assessment, 
599-615 
Parents 
as behavioral change agents, 
540 
Pattern 


anomalies, 267-68 
in pereeption research, 
266-69 
see also Illusions 
Pattern vision 
lack of 
in morphological change, 
384 
Peer group influence 
in drug use, 46, 55-56 
in male attitudes towards 
school, 106 
Pentylenetetrazol 
effects on memory and 
learning, 77 
Perception 
of angles, 266-67, 270 
applications, 279-80 
aspects of the world, 266- 
T2 
color, 269-70 
pattern, 266-69 
size and distance, 270-72 
and attention 
in children's learning, 98- 
99 
conclusion, 281 
context, 272-75 
contingency -aftereffects 
paradigm 
and feature detection, 277 
cross-cultural studies of, 
268-70, 272 
drawing inferences from 
data, 264-66 
figural-aftereffeets paradigm, 
265 
methodology, 280-81 
response bias, 280-81 
in operative thought, 195 
perceptual mechanisms, 
275-79 
backward masking, 278 
feature detectors, 275-78 
prismatic adaptation, 278- 
79 


of personal causation 
attitude studies, 430 
phenomenological aspects 
of, 281 
and psychophysiology 
in adult development, 77- 
79 
salience studies, 101 
shape and depth 
developmental studies of, 
109 i 
spatial-frequency analyzers, 
278 
theoretical background as 
bias 
in clinicians, 632 
visual, 263-81 
introduction, 263-64 
visual size estimation 
in schizophrenia, 598-99 
Perceptual differentiation ` 


EEG measure of 
sex differences in, 407-8 
Perceptual discrimination 
and racial prejudice, 120 
Perceptual judgment 
imitation of 
among children, 113 
Perceptual-motor develop- 
ment, 108-10 
Perceptual processing 
in adulf memory studies, 
76 
in hearing and aging studies, 
77 
Performance testing 
for personnel, 483 
Personal Data Form 
see Tests and scales 
Personality, 393-411 
adult development, 69-70, 
19-82 
and attitude research, 416 
biology of, 403-6 
conclusion, 410-11. 
continuity models of, 80 
creative 
studies of, 3 
development 
social learning approach 
to, 80 
developmental dimensions of 
discussion, 395 
differences 
among therapists, 520-21 
ego development 
and moral development, 
399-401 
and emotions, 408-10 
extroversion and EEG, 404- 
5 
hypnotic responsiveness 
correlates of, 27-28 
introduction, 393-94 
methodological alternatives 
in, 410 
reorientations in theory and 
method, 394-98 
person, sizuation, inter- 
action, 395-96 
review of research, 399-410 
sex differences in, 401-3 
and social development, 112- 
25 
structure 
and adult development, 81- 
82 
style, 406-8 
variables 
and age difference studies, 
82 
in attitude research, 438- 
39 
in dissonance studies, 425 
Personality change 
continuity model of, 356 
in psychoanalysis 
Malan studies, 514 
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Menninger study, 512-14 Photometry 
Personal Orientation Inventory heterochromatic 


see Tests and scales comparison of methods, 
Personnel attitudes 265 
goal setting and management Photoperiodic time measure- 
by objectives, 465-68 ment (PTM) 
the Herzberg theory in seasonal cycles, 151-54 
and job enrichment, 468- Photopigments 
71 and circadian rhythms, 142- 
job satisfaction and perfor- 43 
mance, 473-75 Photoreceptors 
and motivation, 457-76 and circadian rhythms, 141- 
expectancy theory, 458-65 42 
introduction, 457 Physiology 
pay and monetary incentives, effect on intellectual perfor- 
471-73 mance 
Personnel selection in adult development stud- 
classification and placement, ies, 73 
481-501 Piaget 
see also Personnel testing formal operations 
Personnel testing in logic and thought, 191 
APA standards, 483 influence on thinking 
and civil rights, 4886-94 research, 174 
content validity, 484 perception and memory 
criteria, 485-86 work on, 194-95 
general treatments, 482- Piagetian tasks 
83 and cognition studies, 101 
guidelines performance of older per- 
government regulations, sons on, 74 
487-88 Piagetian theory 
job analysis, 483-84 and transitive inference 
job element examining, 485 research, 188 


job performance prediction Pilocarpine 
ethnic differences in, 488- effect on CNS activity, 247 


89 Pineal gland 
methodological developments, in birds 
494.97 as biological clock, 146 
in the seventies, 483 role in seasonal reproduc- 
specific selection techniques tion, 155 
and areas, 497-501 Pitch 
Personology discrimination 
see Personality in perception research, 
Person perception research 272 
and personality, 80 low 
Perspective in complex sounds, 207- 
actor/observer differences 15 
in attributional studies, and harmonics, 212-13 
429-30 Pituitary 
problems role in reproduction cycle 
and child studies, 195 in rats, 159 
Peyote Placebo 
ritual consumption of, 46 in dissonance studies, 428 
Phantom limb effect 
treatment through hypnosis, of drugs on memory and 
33 learning, 77 
Phenothiazine therapy response 
in high redundancy schizo- differing from hypnotic 
phrenics, 601-2, 612 pain reduction, 29 
Phenylketonuria Plane geometry 
and prevention of mental psychological 
_ disorders, 573 and perception, 266 
Phobias Poggendorff display 
behavior therapy in, 526 see Tests and scales 
desensitization in treating, Poisson equation 
527-31 in synaptic plasticity, 379 
treatment through hypnosis, Political ideology 
34 and moral reasoning 
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in student activists, 400 
Ponzo effect 
in visual perception, 268-69, 
278 
Porteus Mazes 
see Tests and scales 
Position Analysis Question- 
naire 
see Tests and scales 
Poverty 
and mental disorder, 569- 
70 
Preferences 
imitation of 
among children, 113 
Pregnancy 
related to child's mental 
development 
longitudinal study of, 642 
Prejudice 
and attitude research, 416 
Preparatory Interval (PI) 
in schizophrenia studies, 
607-11 
Present State Examination 
see Tests and scales 
Primal therapy 
evaluation of, 516 
Primary Mental Abilities 
see Tests and scales 
Primary prevention 
of mental disorders, 557-78 
definitions, 558-59 
see also Mental disorders 
Problems 
perspective and rotation 
in mental imagery studies, 
195 
three-term series 
classification of, 187-88 
Problem solving 
anagram and other word 
problems, 176 
and cognitive style 
in children, 106-7 
and concept attainment 
in adult development, 74 
educational and industrial 
applications of, 174 
general reviews of, 175 
hypothesis of, 180-81 
incubation in, 175-76 
insight problems, 175-76 
mathematical, 176-77 
models of processes, 177- 
78 
role of memory in, 178 
tower of Hanoi, NIM, and 
other kinds, 177 
Productivity 
effect of marijuana on, 57 
Professional and Administra- 
tive Career Examination 
(PACE) 
see Tests and scales 
Progesterone 
in sexual receptivity 


in rodents, 160 
Prolactin 
in relation with corticoste- 
rone 
in seasonal cycle studies, 
153 
Prosocial behavior 
and altruism 
in preschool children, 400- 
1 
in moral development, 116- 
17 
Pseudopatients 
in mental hospital evaluation, 
516-17 
Psilocybin 
tolerance for, 53 
Psoriasis 
treatment through hypnosis, 
34 
Psychiatric Case Study form 
see Tests and scales 
Psychiatric diagnosis 
attempts to improve, 643- 
48 
computerized diagnosis, 
643-45 
specified criteria for, 645- 
48 
table of, 646 
Psychiatric epidemiology, 
636-43 
biological or cultural milieu, 
642 
case registers 
use of, 641-42 
cross-cultural assessments, 
641 
cross-racial assessment, 
638-40 
effects of language on assess- 
ment, 640-41 
Psychiatric Status Schedule 
see Tests and scales 
Psychoanalysis 
20-year Menninger study, 
512-14 
Psychoanalytic theory 
as basis of personology, 
396-97 
in hypnosis, 23-24 
revision of 
and women, 10 
Psychogenic hypothesis 
of mental disorder, 562-67 
origin in Freud's work, 564- 
65 
Psycholinguistics 
relation of language and 
cognition, 100 
Psychological and social 
factors 
in mental disorders, 563 
Psychopathology 
abnormal and clinical, 621- 
60 
comparison, 622 


biometric approach to, 
21-60 
and contingent negative 
variation, 245 
hypnosis in 
use of, 35 
psychophysiological approach 
to, 648-56 
Psychophysics 
auditory, 207-22 
Psychophysiology 
correlates of hypnotic 
responsiveness, 28-30 
and individual differences 
in hypnosis, 20 
pain reduction 
correlates of, 29-30 
and perception 
in adult development, 77- 
79 
Psychosocial crises 
intimacy vs isolation 
in college men, 399 
Psychosomatic disorders 
and complex societies, 
572 
treatment through hypnosis, 
34 


Psychotherapy 

broad spectrum treatises, 
525 

child therapy, 524 

client characteristics 

and process and outcome, 

921-22 

clinical applications of 
hypnosis in, 20 

comparative studies, 510- 
12 


the Di Loreto study, 511 
Temple study, 510-11 
dynamic 
therapeutic effects of, 519 
evaluation of, 345-47 
hypnosis in, 35 
individual , 
and behavior therapy, 509- 
43 
interpersonal attraction in 
study of, .350 
Malan studies of 
and transference, 514 
methodological issues and 
innovations, 524-25 
outcomes of other methods 
rational-emotive therapy, 
516 
persisting issues, 517-20 
process studies, 523-24 
prognostic indicators, 519 
pseudopatients 
in diagnostic evaluation, 
516-17 
race, class, and sex 
of client and therapist, 
522 
effects on outcome, 522-23 


specific techniques and 
patients 
the Menninger study, 512- 
14 


therapist, client, and pro- 
cess, 520-24 
the A-B dimension, 520- 
21 
Truax-Carkhuff measure, 
514-16, 521 
therapist facilitative condi- 
tions 
analysis of empathy warm- 
th, 514-16 
therapists 
comprehensive study of, 
§21 
vs behavior therapy 
in comparative studies, 
512 
and women, 8-13 
Psychotic behavior 
behavior therapy in, 526 
treatment through hypnosis, 
34 
Psychotropic drugs 
medical use af, 51 
Puerto Rican 
college students 
alienation among, 360 
Pulfrich stereaphenomenon 
in size-distance perception, 
271 
Punishment 
and self-control 
in children, 118-19 
Purdue Pegboard 
see Tests anc scales 
Purpuric lesions 
treatment through hypnosis, 
34 


Q 


Quantal analysis 
of synaptic plasticity, 378- 
79 


Quick Test 
see Tests and scales 


R 


Race 
in intellectual ability and 
achievement, 101-4 
in therapist-client relation- 
ships, 522 
Racial awareness and attitudes 
in social and personality 
development, 119-20 
Rape 
attribution of fault in 
and attitude research, 432- 
33 
Rational-emotive therapy 
in comparative studies, 
516 


Raven Progressive Matrices 

see Tests and scales 
Reactance 

and attitude research, 436 
Reaction time . 

and aging, 79 

in schizophrenics, 607-11 
Reading 

and perception studies, 273- 

74 


perceptual components of, 
279-80 
skill 
in job placement testing, 
485 
in personnel testing and 
civil rights, 493-94 
Reid report 
see Tests and scales 
Reinforcement 
in behavior therapy, 526- 
27 
REM sleep 
in the aged, 78 
Reproduction 
biological rhythms of, 137, 
145, 159-61 
Research designs 
for student development, 
350-52 
Response decrements 
in neuronal plasticity, 377- 
85 


Responsibility 
attitudes of, 432-33 
Retardation 
primary and secondary 
discussion of, 102 
and prosocial behavior 
in children, 116 
in Se depression, 
6 


Retarded vs normals 
in various test comparisons, 
651, 653 
Retirement 
attitudes of aged toward, 83 
Rewards and incentives 
in attitude studies, 430-31 
Rhodopsin sensitivity curve 
and SE rhythms, 142- 
4 


Rhythmic behaviors 
effects of light on, 140-44 
Risk taking 
among children 
study of, 107 
RNA 
yeast 
effects on memory and 
learning, 77 
Rod-and-frame test (RFT) 
see Tests and scales 
Rokeach Value Survey 
see Tests and scales 
Role change 
requirements and conse- 
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quences of, 2 
Role-motivation training 
for managerial positions, 
4715-76 
Role playing 
effect of skill in 
and dissonance studies, 
425 
Role taking 
and communication skills, 
110-12 
Role theory 
in hypnosis, 23 
in moral development 
and personality, 400 
in personality development 
student studies, 356 
Rorschach ink blot 
see Tests and scales 
Rorschach responses 
and hypnosis, 23-24 
Rotation problems 
and imagery in thinking, 
195 
Rotter's I-E scale 
see Tests and scales 
Rules 
learning difficulty 
and concept attainment, 
185 


S 


Schedule for Affective Dis- 
orders and Schizophrenia 
(SADS), 648 

Schizophrenia 

age of onset, 594 
arousal 
in acute vs chronie, 604-6 
assessment of, 593-615 
conclusion, 614-15 
cross-cultural assessment, 
641 
eross-national differences 
in assessing, 637-38 
as dementia praecox, 593- 
96 
European definition of, 602 
genetic transmission of, 
601-2 
high-low redundancy classi- 
fication, 600-4 
history from 1911 to 1957, 
596-98 
homogeneous subgroupings, 
599-600 
stimulus redundancy form- 
ulation, 600-4 
inheritance of, 659 
International Pilot Study of, 
639, 641 i 
predicting pathology, 606- 
14 


heart-rate studies, 611- 
13 
language, 613-14 
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Waterloo studies, 607-11 
premorbid adjustment scale, 
598 
prevention of, 561. 
process-reactive classifica- 
tion, 596-98 
religious subgroups of, 658 
slow reaction time in, 597 
and social awareness 
behavior therapy, 526 
subclassifications of, 596- 
604 
synaptic mechanisms in, 
659-60 
visual size estimation 
assessment, 598-99 
Schizophrenics 
hypnotic susceptibility 
among, 24, 34 
in test comparisons, 651-53, 
§55 
therapeutic change in, 516- 
17 
therapist empathy for, 520- 
2i 
Scholastic Aptitude Test 
see Tests and scales 
School programs 
rele in preventing mental 
disorders, 560 
Seasonal rhythms 
and animal behavior, 151- 
56 
circannual, 154-55 
influences on behavior, 155- 
56 
photoperiodism, 151-54 
Seizures 
aversive conditioning in, 
531 
treatment through hypnosis, 
34 
Self 
and attitude research, 416 
Self-actualization 
and voice quality, 406 
Self-awareness 
and attitude 
research on, 415 
objective 
in attitude studies, 436-37 
Self- concept 
and reminiscence 
in age difference studies, 
82: 


Self-control 
and behavior therapy, 540 
in children 
development of, 114-19 
and family interaction, 122- 
23 
_ observation in increasing 
and behavioral analysis, 
628 
Self-disclosure 
in therapy, 340-42 
of volunteer subjects, 420 


Self-esteem 
as personality variable 
in attitude study, 438-39 
Self-help behavior 
behavior therapy in, 526 
Self-management training 
in behavior therapy, 538 
Semantic desensitization 
aversive conditioning in, 
533 
Semantic differential scale 
see Tests and scales 
Semantics 
in attitude research, 417 
Sensation 
and biological rhythms, 
161-62 
and illusion 
in adult development, 77- 
78 
Sense impressions 
as encoding features in 
memory, 313 
Sensitivity training 
as psychotherapy substitute 
for women, 9 
Sensitization 
neural changes causing 
in invertebrates, 372 
as precursor of associative 
learning, 377 
Sensory deprivation 
and age difference studies, 
78 
as modifying hypnotic sus- 
ceptibility, 27 
Sequential studies 
of adult personality develop- 
ment, 80-81 
Sex 
differences 
in adult development, 74 
in aversive conditioning 
for smoking, 533 
in cognitive-perceptual 
styles, 407-8 
and contingent negative 
variation, 245 
in Fear Survey Schedule 
scores, 626 
in intellectual development 
etc., 104-5 
in personality, 401-3 
in self-disclosure in 
therapy, 341 
in social development, 
120-21 
discrimination 
Congressional hearings 
on, 4 
education 
role in preventing mental 
disorders, 560 
of model 
in aggression studies, 118 
mortality differentials 
biologically caused, 12 


in therapy outcome 
role of, 523 
Sex-role 
conception 
model of, 3 
definition 
androgynous, 3 
development and personality, 
80 


and fear of success, 6-7 
identity 
studies of, 3-4 
psychological studies of, 1- 
5 


socialization and typing 
and family, 402 
standards 
MF measures, 3 
typing 
and achievement in child- 
ren, 105-6 
and birth order studies, 
125 
measures of, 3 
variation 
in other societies, 2-3 
Sexuality 
attitudes of aged toward, 
83 
Sharing behavior 
in children 
studies of, 116-17 
Shock 
electrical 
use in aversive condition- 
ing, 531-33 
Sibling influences 
on social and personality 
development, 124 
Signal detection 
methods 
in psychiatric assessment, 
651-53 
task 
in memory, 305 
theory 
in perception, 280 
Signature 
and personality style, 407 
Significance Batting Averages 
see Tests and scales 
Situation 
in personality research, 
395-96 
Skill 
use of hypnosis in investiga- 
ting, 34-35 
Sleep 
and activity 
as biological rhythms, 
158-59 
disturbances in depression, 
631-32, 634 
and evoked response research, 
236-37, 245-46 
interaction with hypnosis, 
32 


REM and hypnosis, 32 
talking 
under posthypnotic sugges- 
tion, 32 
SMOG index of readability 
see Tests and scales 
Smoking 
affect-based typology of, 
410 
aversive conditioning in, 
§32-33 
behavior therapy in treating, 
528 
control through hypnosis, 33 
covert reinforcement in 
treating, 538 
personality factors in, 404 
Smell 
elicited by evoked response, 
241 


Snakes 
fear of, 527, 534, 536 
Social age 
in adult development studies 
measurement of, 70 
Social class 
and cognitive abilities, 102- 
4 
differences 
in infant's language abilities, 
99-106 
in mother-infant interac- 
tions, 122 
as factor in intellectual 
ability and achievement, 
101-4 
Social development 
and personality, 112-25 
Social disability 
and poverty, 570 
Social interaction 
and egocentrism 
age difference studies, 
82 
imitation of 
among children, 113 
Socialization 
and achievement in women, 
early 
and sex-role identity, 3 
and personality, 80 
Social judgment theory 
involvement and attitude, 
435-36 
Social motivation 
use of hypnosis in investiga- 
ting, 35 
Social norms 
perception of 
and reference group expec- 
tancies, 418 
Social-personality area 
in adult development studies, 
69 
Social psycholcgy 
attitudes and opinions, 


415-43 
of experiments 
and attitude, 419-20 
- research methods in, 416 
theory and method 
criticisms of, 416 
Social responsibility 
sequential study of, 80-81 
Social stimuli 
in circadian rhythms, 144 
Sociocultural 
change 
in aging studies, 68-71 
differences 
in adult intelligence scores, 
73 
Socioeconomic factors 
in age difference studies, 
82 
in family interaction studies, 
122-23 
Somatosensory evoked 
responses, 237, 241, 243,. 
245-47 
Sophrology 
relationship to hypnosis, 
36 


Sound 
complex 
low pitch of, 207-15 
detection of, 217-19 
experiments 
see Auditory psychophysics 
lateralization of, 220-21 
localization of, 220 
subjective duration of, 219 
Spatial reference systems 
developmental studies of, 
109-10 
Speech 
inner and private 
in thinking, 193 
interaction 
in psychotherapy, 523 
internal 
and thought, 175 
in loudness experiments, 
215 
and memory 
phonemic encoding, 296- 
300 
and motor activity in child- 
ren, 101 
and reading 
relationship between, 279 
of schizophrenics and nor- 
mals 
in auditory feedback tests, 
655 
as stimuli inevokedresponse 
research, 241 
style-changing 
among children, 111 
Spiegel scale 
see Tests and scales 
Stanford Hypnotic Susceptibil- 
ity scale 
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see Tests and scales 
Statéstical theory 

of low pitch, 215 
Statistics 

Friedman- Rubin clustering 

method, 636 

Status seeking 

a among women, 


Stein's Symbol Gestalt 
see Tests and scales 
Stereotypes 
and attitudes 
in adult development and 
aging, 82-83 
sexual, 4-5 
Sternberg choice reaction- 
time memory-~ search task 
see Tests and scales 
Steroids 
effects of 
on hypothalamic-pituitary 
axis, 160 
Stimulants and depressants , 
patterns of use 
frequency amount and 
duration, 54-55 
prevalence and trends, 50- 
51 
Stimulus Incongruity theory 
motivational 
in attitude studies, 438 
Stimulus redundancy 
in schizophrenia, 600-4 
Stress 
environmental 
and mental disorders, 571- 


and life events 
in precipitating mental 
disorders, 643 
Stroop Color Word Test 
see Tests and scales 
Structured Clinical Interview 
see Tests and scales 
Student activists 
relationship of moral reason- 
ing and political ideology, 
400 
Student development, 350-61 
alienation 
cross-racial study, 360 
attitude and personality 
variables, 354 
breakdown of the student 
population 
dropouts, 357 
and counseling, 337-61 
disadvantaged students, 358- 
59 
dormitory living 
impact of, 354 
finances and college atten- 
dance, 359-60 
low achievers 
studies of, 360 
low income students, 360 
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marriage effects on atten- 
dance, 359 ` 

methodological issues, 350- 
52 


minority students, 359-60 
older freshmen 
studies of, 360 
philosophical issues, 354-57 
research 
state of, 361 
residents vs commuters, 
358-61 
statistical issues, 353-54 
teaching techniques, 354-55 
veterans 
studies of, 360 
Stuttering 
aversive conditioning in, 
531 
hypnotic treatment, 34 
rational-emotive therapy in, 
516 
shock treatment in, 542 
Success 
and failure 
sex differences in, 7 
Suicide prevention, 575 
Swedish children 
aggression and "mobbing" 
behavior, 118 
Syllogistic reasoning 
and concept attainment, 
189 
Symbiosis anxiety 
and masculinity development, 
12 
Synaptic 
decrement 
and transmitter depletion, 
380-81 
mechanism 
in mental disorders, 659- 
60 


plasticity 
behavioral relevance of, 
367-85 


in neurophysiology of learn- 


ing, 378-85 


T 
Tasks 
memory 
age deficits in, 76 
Piagetian 


and cognition studies, 101 
pitch-discrimination 
in perception research, 
272 
in problem solving process- 
es, 177-78 
rotation perspective 
and perceptual role-taking, 
112 
transfer 
age differences in, 76 
Taste 


elicited by evoked response, 
241-42 
Teaching techniques 
in college 
and student development, 
354-55 
Temperament 
and personality 
role of biology in, 403-5 
Temperature cycles 
in circadian rhythms, 145 
Temple study 
comparing psychoanalytic 
and behavior therapies, 
510-11 ` 
Temporal change 
in visual perception 
role of, 275 
Temptation 
and dissonance studies, 
425 
Tennessee self-sort 
see Tests and scales 
Terminal drop 
as predictor of death, 70- 
71 


Testicular development 
in seasonal cycles 
inanimals, 153 
Testing 
behavior-analytic view of, 
624-25 
better patient performance, 
645-55 
and civil rights, 486-94 
differential validity 
job performance predic- 
tions, 488-89 
major summaries of, 481 
mental patients 
problems of, 649-52 
personnel selection, classi- 
fication 
and placement, 481-501 
see also Personnel testing 
Tests and scales 


American College Test (ACT) 


and college dropouts, 357 
Army Alpha 
in adult development stud- 
ies, 72 
in personnel selection, 489 
Asch conformity task 
in personality research, 
399 
assertiveness scales 
in assessing behavior 
therapy, 539 
Babcock- Levy 
and early age-difference 
studies, 74 
Barber Suggestibility Scale 
(BSS), 26 
in hypnotic induction, 21 
Barrett-Lennard Relation- 
ship Inventory 
in therapist empathy, 515 


Berkeley Guidance Study 
on adult personality, 81 
follow-up study at midlife, 
74 


Bernreuter Self~ sufficiency 
Questionnaire 
in behavioral analysis, 
625 
Brown-Peterson paradigm 
in memory study, 295, 
298, 302, 306, 322 
California Personality 
Inventory 
follow-up of, 80 
sex differences in person- 
ality, 402 
in therapy research, 340 
Cattell 16 Personality Fac- 
tors 
generational studies of 
women, 81 
in student development 
studies, 355 
Cloze Test 
and field independents, 
407 , 
College Characteristics 
Index (CCT) 
and student development, 
356-57 
College and University 
Environment Scale (CUES) 
and student development, 
356-57 
Cornell JDK 
and job attraction, 473 
Draw-a- Man test 
and age differences, 74 
Edwards Personal Preference 
Schedule 
and college dropouts, 357 
Elgin scale 
in process-reactive schizo- 
phrenia, 598 
Eysenck Personality Inventory 
in personnel selection, 
498 
Fear Survey Schedule 
in assessing behavior 
therapy, 539 
in behavioral analysis, 
§ 25-26 
Federal Service Entrance 
Examination 
in personnel selection, 
490 
General Aptitude Test battery 
and age differences, 74 
in personnel selection, 498 
Gordon Personal Profile 
and Personal Inventory 
in personnel selection, 
498 
Graduate Record Examina- 
tion (GRE) 
in personnel selection, 
490-93 


Group Therapist Orientation 
Scale 
in counseling research, 
341 
Haistead-Reitan 
in LSD users, 57 
Health-Sickness Rating Scale 
in Menninger study of 
psychoanalysis, 513, 524 
hypnotic responsiveness 
scales, 25-26 
group alert scale, 26 
Imagery measures 
in assessing behavior 
therapy, 539 
International Personnel Man- 


agement Association series 


in personnel Selection, 490° 
IQ tests 
conflict over use, 102. 
in dissonance studies, 425 
Jung's word association test 
in schizophrenia assess- 
ment, 596 
Junior Professional Assis- 
tant exam 
in personnel selection, 490 
Kelly Rep Test 
in schizophrenia assess- 
ment, 596 
Matching Familiar Figures 
test 
problem solving strategies 
in children, 106 


Maudsley Personality Inven- 


tory 


in behavioral analysis, 625 _ 


Michigan State Affective 
Sensitivity Scale 
in counselor training, 344 
Minnesota Multiphasic Per- 
sonality Inventory (MMPI) 
in adult personality studies, 
81 


and hypnotic responsive- 
ness, 27 

related to vocal styles, 
406 

in therapy evaluation, 340, 
511 . 


Müller-Lyer display 
illusion in schizophrenia, 
500-1 
in visual perception, 264, 
268 
Myers-Briggs Type Indica- 
tor 
and short-term memory 
tasks, 398 
National Merit Scholarship 
Qualifying Test 
and minority students, 359- 
60 
National Teachers Examina- 
tion 
discrimination in, 493 
Navy Maze test 


in personnel selection, 
498 
Navy Vocational Interest 
Inventory 
in personnel selection, 
498 
O'Connor Finger Dexterity 
Test 
in personnel selection, 498 
Personal Data Form 
in behavioral analysis, 
627-28 
Personal Orientation Inven- 
tory 
related to voice quality, 
406 
Poggendorif display 
in visual perception, 267- 
69 
Porteus Mazes 
in personnel selection, 
"498 
Position Analysis Question- 
naire 
in job analysis testing, 
484 
Present State Examination 
in schizophrenia investiga- 
tion, 657 
translation of, 640 
Primary Mental Abilities 


in adult development stud- 


ies, 72-73 
Professional and Adminis- 
trative Career Examina- 
tion (PACE) 
in personnel selection, 490 
Psychiatric Case Studyform 
in behavioral analysis, 
628 
Psychiatric Status Schedule 
translation of, 640-41 
Purdue Pegboard 
in personnel selection, 
498 
the Quick test 
on age differences, 74 
Raven Progressive Matrices 
and birth order studies, 
124 
Reid report f 
in assessing honesty, 501 
Rod-and-frame test (RFT) 
in perception research, 
274-75 
Rokeach Value Survey 
in attitude study, 439 
Rorschach ink blot 
development of, 596 
in hypnosis, 23-24 
related to vocal styles, 
406 
Rotter's I-E scale 
and coping mechanisms, 
475 
Schedule for Affective Dis- 
orders and Schizophrenia 
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(SADS), 648 
Scholastic Aptitude Test (SAT) 
and disadvantaged students, 
359 
scoring posthypnotic amne- 
sia, 3i 
semantic differential scale 
use in attitude research, 
417 
Significance Batting Averages 
in personnel testing, 485- 
86 
SMOG index of readability 
in personnel testing, 493 
Spiegel scale, 25 
' Stanford Hypnotic Suscepti- 
bility Scale, 25-26 
Stein's Symbol Gestalt 
and age differences, 74 
Sternberg choice reaction- 
time memory- search task 
in cognition and personal 
style, 408 
Stroop Color Word Test 
and age differences, 74 
and perception interference, 
273 
Structured Clinical Interview 
in multilingual study, 641 
Tennessee self-sort 
generational difference 
studies, 82 
Thematic Apperception Test 
(TAT) 
age and achievement moti- 
vation, 82 
Thurstone categorization task 
in attitude research, 417- 
Truax Accurate Empathy 
Scale 
in therapy research, 339 
Truax Empathy scale 
correlation with similar 
scales, 515 
Truax Nonpossessive Warmth 
and Genuineness 
in therapy research, 339 
Truax scales 
in therapist evaluation, 
510-11 
USES General Aptitude Test 
battery 
in personnel selection, 
491 
Venn diagram 
in logie, 189 
Wason's four-card problem 
in conditional reasoning, 
191 
Wechsler Adult Intelligence 
Scales (WAIS) 
in adult development stud- 
ies, 72 
and age differences, 74 
and physical health, 73 
Wechsler Memory Test 
and age differences, 74 
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Willoughby Personality 
Schedule 
in behavioral analysis, 625 
Witkins' field dependence- 
independence 
in relation to intelligence, 
74 
Wonderlic 
in personnel selection, 
489 
WWI Army General Classi- 
fication Test 
in personnel selection, 489 
Thalamus 
and contingent negative vari- 
ation, 242 
in evoked response, 240, 
243 
Thematic Apperception Test 
(TAT) 
see Tests and seales 
Therapeutic communities 
in drug treatment, 59 
Therapist facilitative condi- 
tions 
Rogers- Traux-Carkhuff 
analysis of empathy etc., 
514-16, 521 
Therapists 
comprehensive study of, 
521 
interpersonal characteristics 
of 
importance of, 512 
Therapy 
behavior vs psychotherapy 
in comparative studies, 
512 
and counseling 
review of, 337-50 
substitutes 
for women, 9 
Thinking 
and concept attainment, 173- 
76 
concept identification, 179- 
87 
introduction, 173-74 
language and thought, 192- 
96 
logic and thought, 187-92 
monographs and general 
surveys of, 174-75 
problem solving, 175-78 
definition of, 173 
recursive 
as role~taking task, 111 
Thurstone categorization task 
see Tests and scales 
Time 
intervals 
in low pitch of complex 
sounds, 208 
Time-perspective 
and the elderly 
studies on, 82 
Time-sense 


H 


and biological rhythms 
in various animals, 162 
Tones 
frequency shifts in 
implications of, 212 
Touch/vision coordination 
and prismatic adaptation, 
278-79 
Training 
and classification 
in children, 99 
and conservation 
in children, 99 
Trait 
personality 
discussion of theory, 395- 
96 
Trance 
controversy over, 20-22 
Trimethadione 
effect on CNS activity, 247 
Truax Accurate Empathy 
Scale 
see Tests and scales 
Truax Nonpossessive Warmth 
and Genuineness 
see Tests and scales 
Truth tables 
use in concept identification 
research, 185-86 
Twin studies 
and achievement striving, 
105 
cerebral potentials in, 243 
and hypnotic susceptibility, 
26 . 
Kallman-Jarvik 
reports on survivors, 74 
and temperament, 403-4 
visual evoked response 
hereditary factors in, 
240 


U 


Ugandian villagers 

in Ponzo illusion study, 268 
Urban stress 

and attitude research, 416 
USES General Aptitude Test 

battery 

see Tests and scales 
US-UK Diagnostic Project 

in psychopathology, 637-39 


V 


Venn diagram 
see Tests and scales 
Verbalization 
ineffectiveness in altering 
behavior 
of children, 116 
Verbal learning and memory, 
291-327 
see also Memory 
Verbal mediation 


in thinking, 194 
Verbal reasoning ability 
and family size, 124-25 
Vision 
seasonal changes in sensiti- 
vity, 161-62 
Visual 
age difference studies, 78 
function 
declining with age, 7T- 
78 
illusions 
in adult development stud- 
ies, 69 
pathways 
and circadian rhythms, 
143-44 
evoked response research, 
243 
perception, 263-81 
see also Perception 
preference 
in infants, 109 
recall tasks ` 
and adult memory, 76 
‘response 
and electrophysiology, 243 
system 
of cat, 384 
changes in synapses in, 
384 
thinking 
and problem solving 
research, 174 
Visual evoked responses (VERS), 
238-40, 243, 245-47 
brightness, frequency, and 
movement, 238-39 
changes with maturation 
and aging, 240 
color and spectra, 239 
pattern and contour, 239 
in schizophrenic arousal, 
606- 
species differences, 240 
to word stimuli, 239-40 
Visualization 
as stable through adulthood, 
73 
Visuo-motor flexibility 
as decremental, 73 
Vocational training 
as psychotherapy substitute 
for women, 9 
Voice 
role of in personality style, 
406 
Voyeurism 
aversive conditioning in, 
531, 533 


W 
Warts 
treatment through hypnosis, 
34 


Wason's four-card problem 


see Tests and scales 

Wechsler's Adult Intelligence 
Scales : 

see Tests and scales 

Willoughby Personality 
Schedule 

see Tests and scales 

Witkins' field dependence- 
independence 

see Tests and scales 


Women 


achievement motivation in, 
1, 5-8 
emotional development in 
effect of father absence on, 
124 . 
fear of success in, 6-7 
feminine development 


biosocial theory of, 2 
hypnosis and creativity, 27 
low income 
"in student development 

studies, 360 
personality studies of, 81 
personnel testing 

and civil rights, 491-92 
psychology of 

introduction, 1-2 

selected topics, 1-13 
and psychotherapy, 8-13 
and self-disclosure in 

therapy, 341 
sex roles 
psychological studies of, 
2-5 
sex-typed behavior in 
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encouragement of, 106 
therapy substitutes for, 9 
Wonderlic 
see Tests and scales 
Word Fluency 
and agingystudies, 73 
WWII army General Classi- 
fication Test 
see Tests and scales 


Z 


Zambian infants 
in spatial ability studies, 
Zinacanteco Indians 
mother-infant interaction 
studies, 121 
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